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Ground truths 


Nature survey shows most scientists are happy at work, but that a significant number 
still face discrimination — an unacceptable situation. 


data to look for specific landmarks that instil confidence in 

the results. Some are defined by cold mathematics: 71% can 
be presented as “most people” without much controversy. But many 
others are subject to interpretation. Do 14 of 16 people constitute 
“almost everybody’, or does that take 15? 

Perhaps most important is how to handle the lower reaches. Can a 
single voice from 100 be written off as an outlier? What about two or 
three? How big does a minority have to become before it gets a bullet 
point in a report? The data — the numbers that surveys produce — 
usually tell the full story to those who are willing to look, but most 
(that word again) of us rely on a more human narrative to make sense 
of the results. And here, care is needed. 

This week, Nature publishes the results of our biennial survey of the 
income and career satisfaction of scientists across the world. And one 
narrative that emerges is heartening. Most — 68% — said they were 
satisfied or very satisfied with their careers. And just over half — 51% 
— had received a pay rise in the past year. 

The majority may rule, but it hardly tells the whole story. Nobody 
should take any comfort, for example, from the fact that most scien- 
tists (72%) told the survey they have not witnessed any instances of 
harassment or discrimination. The corollary of that figure is clear: 28% 
have. Nobody can be satisfied with that. The problem of harassment 
has received some much-needed attention in recent years, but, as these 
figures show, there remains much work to be done, and attitudes and 
behaviours still need to change. 

The survey has limitations. The results are based on the anonymous 
responses of 4,334 (self-selected) people who have pursued science 
beyond an undergraduate degree. Three-quarters of them are based 
in North America or Europe. Still, many of the figures do mirror those 
of other surveys — high levels of job satisfaction among scientists 
working across academia and industry, for instance. Researchers are 
generally a content and motivated bunch. But look in the margins and 
there remains much room for improvement. 

Poor mental health continues to be a huge concern, with more than 
one-third (36%) of respondents saying they needed or were receiving 
help for depression or anxiety. Attitudes from colleagues were not 
always supportive. “I had a mental health crisis and instead of helping 
I was suspended from work and threatened with potential dismissal,” 
wrote one. Many universities are aware of this issue and are working 
to improve care and support. But not all are succeeding. 

The survey reveals other institutional failings, too. Sadly, only 
half of university scientists said their institution was doing enough 
to promote diversity. And 21% said they had personally experienced 
harassment or discrimination. This was most commonly based 
on gender, but the list also included discrimination based on race, 
religion, sexuality and age. One respondent wrote: “Co-workers have 
scheduled important meetings on religious holidays and when I object 
or do not attend, I’m viewed as someone who doesn't take their job 


r | here is a tendency when drawing conclusions from survey 


seriously.’ Another said: “A liberal faculty will shun and even harass 
conservative Christians, mocking them openly.” 

Some 23% of people who replied to the survey reported discrimina- 
tion based on age. One respondent complained of “Pressure to retire 
as I approach age 60. Not explicit or stated, but moral pressure and 
looks? And about the same number (22%) said they had suffered 
racial bias. 

This is unacceptable. Science must do better on these issues, as 
individuals and institutions. The survey holds up a mirror to the 
research community, and if the community does not like what it sees 

— and it should not — then all of us must do 


“The survey more to change the picture. 

holds up a Science should be a rewarding career. 
mirror to Most scientists say they do enjoy their work 
the research and — at least according to this survey — 


most get through the day without being 
made to feel that they don’t belong, or that 
they have to do more to prove themselves because of their gender or 
geographical origin. But “most scientists” here is not enough. Indivi- 
duals and groups who do experience such abhorrent discrimination 
must know they are not an overlooked interest. It is everybody’s 
responsibility to condemn such behaviour when they see it. And, 
where they feel comfortable to do so, everybody should speak out 
when injustice occurs. m 


community.” 


@ e@ e@ 
Capital thinking 
Political attention to human capital must be 
backed up with solid research. 


Century, written by French economist Thomas Picketty, 

highlighted the role of wealth — rather than earnings — in the 
way that money makes the world go around. But Picketty chose to play 
down an important part of the system: human capital, the economic 
value derived from the knowledge, skills and abilities that enable 
people to perform paid work. 

How to include human capital in analyses is as much a political 
as an economic problem: critics argue that the concept creates a 
false equivalence between having skills and having money, which 
plays down financial inequality. Supporters insist that it’s a genuine 
measure of the potential of individuals, populations and nations, 
and so a way to indicate their intrinsic value. 

The World Bank has now reignited the debate. Earlier this month, 
it released its much-anticipated Human Capital Index (see go.nature. 


ik surprise 2014 global bestseller Capital in the Twenty-First 
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com/2cwyqqd), which ranks 157 nations according to measures of 
investment in their people. The bank’s measure is relatively simple, 
constructed from data on child survival and growth, years of primary 
and secondary schooling, and health. A country can achieve a perfect 
score if all children born today can expect to survive to 60 without 
impaired growth and development — resulting from poor nutrition, 
repeated infection or inadequate psychosocial stimulation, and meas- 
ured by ratios of height and age — and can expect to have received 
14 years of good-quality schooling by age 18. 

The index is based on the assumption that a country’s economic 
productivity is tied to the knowledge and abilities of its people. It 
followed the release, two weeks earlier, of the results of a parallel 
(but separate) academic exercise by the Institute for Health Metrics 
and Evaluation at the University of Washington in Seattle (S. S. Lim 
et al. Lancet 392, 1217-1234; 2018). 

The World Bank hopes its new index will mimic the success of its 
national “ease of doing business” ranking, which has focused govern- 
ment efforts around the world to reduce corruption and encourage 
outside investment as a way to secure a higher placing than their rivals 
and competitors. The bank wants to demonstrate how measures of 
education and health are linked to the productivity and prosperity 
of a country, assuming that investing in human capital through edu- 
cation and health systems can yield rapid development. In short, it 
wants to push countries to make things better for their people — and 
their human capital ranking. It has certainly managed to draw atten- 
tion: Indian officials immediately protested against their country’s low 
ranking, and government officials there say they will ignore what they 
argue to be a simplistic and misleading measure. 

Top scorers on the World Bank list include Singapore, South Korea 
and Japan, whereas many African countries, including Mali, Nigeria 
and Liberia, performed poorly and were near the bottom of the index. 

The Institute for Health Metrics and Evaluation based its ranking of 
195 countries on similar factors, but incorporated more measures of 
health and education, and used different data sources and methods. Fin- 
land, Iceland and Denmark top its charts, which cover the period from 


1990 to 2016. During this time, the United States tumbled from 6th to 
27th place, largely owing to minimal progress in educational attainment. 
Few would argue against the goal of encouraging better health and 
education. And perhaps by framing these needs in terms of economic 
returns and tapping into the political desire to climb the leader boards, 
these measures might succeed in having a greater impact on decision- 
makers than do simple appeals to the intrinsic good. For example, one 
way to improve a country’s position would 


“These indices be for it to reduce gender inequality in years 
areonlyas good of schooling. 
as the data that But any metric — be it a university rank- 


underliethem.” ing or standardized mathematics testing — is 
selective and must be interpreted appropri- 
ately. Too often it becomes a convenient proxy, leading to inferences 
of quality for which it was never intended, and distorting reality. As 
in most analyses of this type, these indices are only as good as the data 
that underlie them. There is a huge range in the quality and quantity 
of data on both health and education across countries. And although 
deductions about the exact effects of health outcomes and education 
on economic productivity are based on research, the true relation- 
ships are unclear for the range of countries and contexts to which 
the Human Capital Index is being applied. Critics are right to point 
out that a national score does not account for regional differences in 
a country. 

Scientists can play a part here, to ensure that indices such as these 
become the credible motivators that they are intended to be. More 
and better data on indicators of health and educational outcomes will 
improve the accuracy of the indices. More research on rigorous ways to 
capture other determinants of human capital, and on their relationship 
to health, prosperity and well-being, will enrich our understanding of 
how to reach global development goals. Nature recognizes the need 
for such work to help inform policymakers and make their efforts 
more evidence-based. As such, we encourage submissions of high- 
quality data and analysis addressing knowledge gaps in assessing and 
improving human health and well-being. m 


Matters Arising: a 
venue for commentary 


here was a time when scientific progress depended on elaborate 

and often protracted exchanges of correspondence. Charles 
Darwin wrote thousands of letters, and his correspondence with 
influential thinkers had an important impact on his theories. 
This communication was private. Fortunately, much has survived 
and found its way into archives, where it forms a key part of the 
scientific record. 

Although research findings today are mainly disseminated and 
recorded in the form of peer-reviewed research manuscripts, schol- 
arly commentary on published research is still crucial: it can pro- 
vide nuance, refinement and caveats. And these days, it moves fast. 

So, from this week, Nature will consider such post-publication 
contributions as Matters Arising — a format designed to peer- 
review and publish online exceptionally interesting and timely 
scientific comments and clarifications related to primary research 
papers published in the journal. Authors of the original papers 
will be given the chance to reply. If our editors deem that these 
responses move the discussion forward in a constructive way, they 
will be published at the same time as the Matters Arising article. 

We also recognize the need for timely release of these exchanges 
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to the relevant communities, and the difficulty of doing so through 
an often-lengthy peer-review process. So, to accommodate both 
rigorous peer review and the need for timeliness, authors of Matters 
Arising and the original Nature paper are encouraged to release 
preprints during the formal journal process, as supported by 
our policies. Comments can also be made on all original Nature 
research papers online, and these can be linked to relevant com- 
mentaries, to articles published elsewhere and to relevant preprints. 

Decisions to publish Matters Arising will be taken by journal 
editors. To ensure the integrity of the published record, and to help 
readers find all relevant information, published Matters Arising 
articles will be linked to the online version of the original paper 
and to the original authors’ response. This format will replace 
Brief Communications Arising as an avenue for post-publication 
commentary on primary research. 

Over the coming months, we plan to introduce the Matters 
Arising format to the other Nature Research journals, where it 
will replace Correspondence for such discussions. In this way, we 
aim to offer a standardized formal mechanism and a constructive 
peer-review process for post-publication commentary. This should 
allow debate on published papers in the journals’ online pages, and 
provide visibility and credit for authors engaged in these debates. 

All current policies on competing interests, authorship standards 
(including joint authorship) and author contributions, availability 
of data, materials and code (where relevant), and publication of the 
reporting summary will apply to Matters Arising and any published 
reply from the original authors. m 


Limited. All rights reserved. 
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steam engine and electricity have done. But unlike past techno- 
logical revolutions, the AI revolution offers a unique chance to 
improve lives without opening up and exacerbating global inequalities. 

That will require widening of the locations where AI is done. The 
vast majority of experts are in North America, Europe and Asia. 
Africa, in particular, is barely represented. Such lack of diversity can 
entrench unintended algorithmic biases and build discrimination into 
AI products. And that’s not the only gap: fewer African AI researchers 
and engineers means fewer opportunities to use AI to improve the lives 
of Africans. The research community is also missing out on talented 
individuals simply because they have not received the right education. 

Iam happy to be returning to Africa as part of a chance to change 
that. Next month, Internet-technology com- 
pany Google will open an AI Research Lab in 
Accra, Ghana, which will be the first of its kind 
on the continent and which I will lead. We plan 
to employ many engineers and scientists. 

About 15 years ago, I started undergraduate 
studies in maths and physics in my native Sen- 
egal, and began to teach myself AI using courses 
downloaded from the Internet. I went to Paris 
to finish my graduate studies and then took up 
a position in Facebook’s AI research division, 
working to make machine intelligence fair, trans- 
parent and more reliable. 

There are many obstacles to an Al researcher 
from Africa making it into the global commu- 
nity. Ata 2016 conference in Barcelona, Spain, attended by more than 
5,000 people, I was one of fewer than 10 black people. In response, I 
co-founded Blackin AI. It is now a thriving community of more than 
1,000 students, researchers and AI enthusiasts ready to share ideas and 
foster collaboration to increase representation of black people. Despite 
the support, many of us still have trouble making it to conferences. 
I have had papers accepted at meetings but been unable to attend 
because Western countries such as Australia denied me a visa, even 
though I was already settled and working professionally in Europe. 

We need more efforts to overcome these barriers and to ensure that 
the benefits of AI arrive globally. Many of the essential ingredients 
are already in, or coming into place. The human resources are there. 
Africa is home to the youngest and fastest-growing population on 
Earth. Iam 33 years old, and that makes me older than most of the 
continent’s inhabitants (the median age in Africa is 19; in the Euro- 
pean Union, 43). Enthusiasm is huge. Last year, the Deep Learning 
Indaba gatherings across Africa hosted 300 students from 23 African 
countries, and had to turn down more applicants than it could accept. 

Financial resources are also becoming available. Last year, ven- 
ture capitalists poured US$560 million into tech start-ups in Africa. 
Google is supporting and advising more than 60 through Launchpad 
Accelerator Africa. According to the International Monetary Fund 


A rtificial intelligence (AI) is changing society as profoundly as the 
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GIVE 
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Al map to mobilize human and financial resources, 
a 


Look to Africa to advance 
artificial intelligence 


If Alis to improve lives and reduce inequalities, we must build expertise 
beyond the present-day centres of innovation, says Moustapha Cisse. 


in Washington DC, six of the ten counties with the fastest-growing 
economies are in Africa. 

There is a strong support among AI researchers. Last month, the 
African Masters of Machine Intelligence degree programme, which I 
organized with sponsorship from Facebook and Google, started courses 
with 30 students. More than 30 global experts have agreed to come to the 
African Institute of Mathematical Sciences in Kigali, Rwanda, to teach 
it. Ifthe quality ofa programme is judged by the quality of the lecturers, 
this is the best curriculum in the world. 

The next step is to develop a coordinated plan to encourage AI educa- 
tion across the continent, incentivize entrepreneurship in the AI sector, 
and facilitate collaboration between AI researchers and experts in health 
care, agriculture and other sciences. We need a pan-African strategy: a 
set of ambitious goals for AI education, research 
and development and industrialization. 

African nations must forge ties with their local 
Al expertise. Other countries, including China 
and France, have benefited from their country- 
specific initiatives. Canada, in particular, has 
cultivated a good synergy between a strong AI 
research community and receptive political lead- 
ership. As a consequence, the major tech compa- 
nies all have centres in Montreal. 

Another essential step is setting out a road 


including a pan-African AI fund to support coor- 

dinated efforts. A network of African institutes of 

artificial intelligence, for example, could retain 
the best talents on the continent, enlist world-class African scientists to 
tackle AI challenges in the African context and collaborate with existing 
academic institutions. The network could also support policymakers 
and collaborate with the private sector. 

Finally, African governments must create a standard legal frame- 
work and a set of values that will help to ensure that AI in Africa serves 
the good of humanity. There is a growing fear around the world about 
nefarious uses of AI. Fortunately, initiatives to prevent such uses are 
emerging. In the public sector, the European Commission this April 
outlined an approach to setting ethical guidelines for AI. In the private 
sector, Google published a set of standards this June to govern its AI 
research and product development. 

Now is the time to build a foundation that ensures that AI helps 
bring better lives in Africa and beyond. With foresight and planning, 
the technological revolution that AI brings will be a force to empower 
a fair and prosperous society. The oft-overlooked continent has much 
to give and to get from AI. = 


Moustapha Cisse is head and co-founder of the Google AI Research 
Lab in Accra, Ghana, and professor of machine learning at the African 
Institute of Mathematical Sciences. 

e-mail: moustaphacisse@google.com 
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NIH chimps 


Elderly and sick chimpanzees 
owned by the US National 
Institutes of Health (NIH) 
might spend their retirement 
at research facilities instead 

ofa federal sanctuary if they 
are too frail to relocate, the 
agency said on 18 October. The 
NIH decided to retire all their 
chimps from research in 2015. 
Director Francis Collins says 
that the agency will develop 
guidelines to determine 
whether chimpanzees at 
facilities owned or supported 
by the NIH are well enough to 
move. The criteria will include 
assessments of each animal’s 
health, behavioural, social 

and environmental needs. Ifa 
research facility and the federal 
sanctuary — called Chimp 
Haven, in Keithville, Louisiana 
— cannot agree on whether 

an animal should be relocated, 
Collins says, the agency will 
have a panel of independent 
veterinarians inform the final 
decision. 


Transparency rule 
The US Environmental 
Protection Agency (EPA) has 
delayed a decision on whether 
to implement a controversial 
rule that would limit the types 
of scientific research that can be 
used to justify environmental 
regulations. The proposed rule, 
unveiled in April by then-EPA 
administrator Scott Pruitt, 
would prevent the agency from 
basing regulatory decisions 

on studies whose underlying 
data are not publicly 

available — a requirement 

that could eliminate a lot of 
epidemiological and other 
health research. These studies 
often use confidential patient 
data that cannot be made 
public owing to privacy 
concerns. In a decision quietly 
released on 17 October, 

the agency delayed a final 
determination on the rule until 
at least 2020. The proposed 


The news in brief 


BepiColombo heads off to Mercury 


A joint Japanese—European mission that is 

set to be the second ever to enter Mercury’s 
orbit launched successfully on 19 October 
from Kourou, French Guiana. The €1.6-billion 
(US$1.8-billion) mission, called BepiColombo, 
will insert two probes into the planet’s orbit 

in 2025, after several fly-bys of Earth, Venus 
and Mercury itself. One probe, built mainly by 
the European Space Agency (ESA), will study 


tule faced criticism from 
researchers earlier this year, 
and the agency’s scientific 
advisory board voted to review 
the proposal in May. 


Vaccine firm fined 
A Chinese pharmaceutical 
company caught producing 
faulty rabies vaccines 

has been fined billions of 
yuan by national and local 
agencies. China's national 
drug regulator announced 
on 16 October that it will fine 
Changchun Changsheng 
Biotechnology 12 million 
yuan (US$1.7 million), and 
revoke the company’s licence 
to make rabies vaccines. 
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The regulator determined 

that when the company 
created several batches of 
faulty rabies vaccine, it broke 
multiple laws, including: using 
expired products to make 

the vaccine; not testing the 
potency of vaccines according 
to prescribed methods; and 
destroying evidence to cover up 
its actions. The drug regulator 
of Jilin province, where the 
company’s headquarters are 
based, has issued the firm with 
a much larger fine: 7.21 billion 
yuan. The local regulator has 
revoked the company’s licence 
to produce pharmaceuticals 
and has banned 14 executives 
and others involved in the 


Mercury’s surface and its inner structure. The 
other, built by the Japan Aerospace Exploration 
Agency, will focus on the planet's magnetic 
field and its interaction with the solar wind. 
Hours after launch, BepiColombo successfully 
deployed its antennas and two 15-metre-long 
“solar wings’, ESA said. The craft also took 
several “space selfies” using three on-board 
monitoring cameras. 


incident from engaging in 
drug production in the future. 
The company has not publicly 
responded to the penalties, but 
in a statement to its investors, 
the board said it would set up a 
group to handle compensation 
claims. 


Brexit warnings 
World-leading scientists and 
mathematicians from across 
Europe have called on UK and 
European Union leaders to 
maintain the “closest possible 
cooperation’ on science after 
Brexit, and have warned 

that any barriers to research 
collaboration in Europe will be 
to the detriment ofall. The calls 


ESA/S.CORVAJA 


were made in a letter signed 

by 29 Nobel laureates and 

= 6 recipients of the prestigious 
Fields medal for mathematics, 
& and sent to UK Prime Minister 
5 Theresa May and European 
Commission president Jean- 
Claude Juncker on 19 October. 
They come as the United 
Kingdom approaches its March 
2019 deadline for leaving 

the EU, and amid stalling 
negotiations on the country’s 
future relationship with the 
bloc. Brexit could see the end 
of Britain’s participation in 

EU research programmes — 
although UK leaders have 
indicated a desire to be part of 
future initiatives. The letter says 
that the United Kingdom must 
step up its commitment if it 
wants to stay involved. 


Science mega-prize 


The inventor of a revolutionary, 
high-resolution microscopy 
technique was one of six 

big winners of this year’s 
Breakthrough prizes, 
announced on 17 October. 
The awards, each worth 

US$3 million, are given out 
annually in the life sciences, 
mathematics and fundamental 
physics. Xiaowei Zhuang 
(pictured), a biophysicist 

at Harvard University in 
Cambridge, Massachusetts, 
received one of the four 


R/HHMI 


L SEI 


life-sciences prizes for leading 
the development of stochastic 
optical-reconstruction 
microscopy — known as 
STORM — just over a decade 
ago. The technique was one of 
the first to break a fundamental 
resolution limit of conventional 
light microscopy, and is now 
used widely by biologists to 
reveal the hidden molecular 
structures of cells. Charles 
Kane and Eugene Mele at the 
University of Pennsylvania 

in Philadelphia won the 
fundamental-physics prize 

for their work predicting the 
existence ofa type of exotic 
material known asa topological 
insulator. See go.nature. 
com/2nxeyvs]1 for a full list of 
winners. 


Physicist retires 


Lawrence Krauss will retire 
from his position as a professor 
at Arizona State University 
(ASU) in Tempe in May 2019. 


SOURCE: NASA 


‘TREND WATCH | 


The BepiColombo probe, 
launched on 19 October, will 

be only the third mission to 
approach Mercury. Meanwhile, 
NASA is planning the landing 
site for its next rover to Mars, 

a planet that has been targeted 
around 50 times. So which 

of our Solar System's planets 
proved most popular with 
space scientists — and why? 
Since planetary missions began 
in the 1960s, each planet has 
had at least a flying visit from a 
spacecraft — but the vast bulk 
of attention has been focused 
on just a few. Mars and Venus 
top the list, partly owing to their 
proximity and to interest in their 


past conditions — water on Mars 
and Venus'’s previously more 
Earth-like climate. Mercury has 
been neglected among the inner 
planets because it is challenging 
to reach. Its proximity to the 

Sun means that craft need to 
slow down considerably before 
they enter the planet's orbit. 
Visiting Uranus, Neptune and 
the dwarf planet Pluto is also 
technically challenging, but some 
astrophysicists say that to truly 
understand the Solar System, 
humans will need to step up 
exploration of these bodies and 
their moons. Scientists are also 
planning to send two orbiters to 
Jupiter’s moons. 


Earlier this year, the university 
began investigating allegations 
of sexual harassment against 
the physicist, allegations 

that Krauss denies. He has 
been on administrative leave 
since early March, following 
news reports of alleged 
harassment. The university 
accepted Krauss’s request to 
retire, an ASU spokesperson 
said on 21 October. Krauss 
founded and led ASU’s Origins 
Project on cosmic questions 
for nearly a decade; it is now 
being transformed into an 
Interplanetary Initiative to 
focus on the future of humans 
in space. 


HEALTH 


Ebola outbreak 

The World Health 
Organization (WHO) declared 
on 17 October that the Ebola 
virus outbreak in the northeast 
of the Democratic Republic of 
the Congo (DRC) — although 
extremely worrying — is not 

a ‘public health emergency 

of international concern. 

The organization said that 
declaring the status would 

not significantly improve an 
already robust containment 
effort. The outbreak began in 
early August and is centred 
around the city of Beni. As of 
21 October, there have been 
203 confirmed cases and 

35 probable cases — including 


PLANETARY VISITS 


SEVEN DAYS | THIS WEEK | 


153 deaths. The outbreak is the 
tenth in the country’s history 
and has been complicated 

by armed conflicts in the 
region that are hampering 
containment efforts. The 
WHO noted the DRC’s long 
experience in handling Ebola 
episodes but said that ongoing 
vigilance is crucial. 


Lassa fever 

Rats fuelled the largest 
outbreak of deadly Lassa fever 
in Nigeria this year, according 
to the most extensive and rapid 
genomic analysis of the Lassa 
virus conducted so far. The 
study, published on 17 October, 
eases fears that Lassa had 
mutated into a super-virus 
that was spreading swiftly 
between people (K .J. Siddle 

et al. N. Engl. J. Med. http://doi. 
org/gfdc4v; 2018). Instead, the 
viral genomes harvested from 
220 patients were surprisingly 
diverse, indicating that most 
people had not acquired their 
infections from someone else. 
The unprecedented speed. 

of this analysis has helped 
officials to combat the spread 
of Lassa fever, and the virus’s 
genetic information will assist 
researchers as they develop 
vaccines against the illness. 
About 514 people fell ill with 
the disease between January 
and September, and 134 of 
them have died. See go.nature. 
com/2r9vl8x for more. 


Nature counts up past, present and future missions to planets and 
dwarf planets in the Solar System — including failed missions and 


visits to moons other than our own. 


60 
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Could British universities maintain access to EU research funds even with a ‘no-deal’ Brexit? 


RESEARCH COLLABORATION 


UK universities go for Brexit 
gambit to safeguard funds 


British institutions set up EU outposts with continental counterparts. 


BY QUIRIN SCHIERMEIER 


S™ of Britain’s leading research 


institutions are establishing alliances 
with counterparts in other Euro- 
pean countries — a move that might allow 
them to keep drawing on the European 
Union's science funds even in the case of a 
‘no-deal’ divorce from the bloc, the most 


extreme form that Brexit could take. 

To access the €100 billion (US$115 billion) 
in research funding that the EU proposes to 
make available for 2021-27, scientists must be 
based at host institutions that are legal entities 
in the EU or associated countries. That might 
soon cease to be the case for UK universities, 
many of which stand to lose tens of millions of 
euros that they get from EU funds at present 


(see ‘Brexit’s high stakes’). The outcome 
depends on the terms of Brexit, which are the 
focus of intense negotiations. 

“In principle, UK entities might remain 
eligible for funding under EU criteria if 
they have a legal presence in an EU member 
country,’ says Jan Palmowski, director of the 
Brussels-based Guild of European Research- 
Intensive Universities. For example, > 
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BREXIT’S HIGH STAKES 


Many UK universities stand to lose tens of millions of euros in research funding from EU framework 
programmes. These are the top five UK universities in terms of income from the Horizon 2020 programme. 


University of Oxford 
University of Cambridge 


University College London 


Imperial College London 
University of Edinburgh 


100 


> recipients of grants from the European 
Research Council must spend at least 50% of 
their time at a host institute in the EU or asso- 
ciated country. So, continental outposts could 
help UK researchers to continue to access 
those grants, even if their country ceases to 
officially receive them. 

Palmowski says that stable alliances 
with continental partners might also help 
UK universities to safeguard EU-funded 
research collaborations and student exchanges. 
The idea that fruitful research relations built 
over decades might go to pieces is “dismay- 
ing and heartbreaking”, says James Conroy, 
vice-principal for internationalization at 
the University of Glasgow, which hopes to 
establish such partnerships. 


OXFORD AND BERLIN 

Of several alliances launched in recent 
months, a partnership between the University 
of Oxford and four institutions in Berlin is so 
far the most comprehensive. Established at 
the end of 2017, the Oxford—Berlin Research 
Partnership is mainly financed by the Berlin 
state government and private sponsors. This 
year, the alliance launched a pilot call for 


Number of projects 427 
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350 


proposals and made €10,000-30,000 available 
in seed grants, with the intention of raising 
additional third-party funding. Any faculty 
members of the five institutes can apply. A sec- 
ond call is to be announced next month. Cru- 
cially, the partnership will serve as Oxford’s 
legal entity in Germany, and will provide an 
administrative office at the university clinic 
Charité in Berlin for visiting researchers. That 
means, at least in theory, that some Oxford- 
based researchers might be able to access 
EU funding. Berlin has also promised to pro- 
vide space for visiting Oxford scholars in its 
Natural History Museum. 

The likely cost of running the partnership 
will be around €800,000 a year, says Alastair 
Buchan, a pro-vice-chancellor and head of 
Brexit strategy at Oxford and director of the 
university’s Berlin office. And he estimates 
that this will further enable many millions 
of euros of research projects and activity. 
“We're finally doing what we should have 
done since the day the UK joined the EU 
in 1973,” says Buchan. “We took the free- 
dom to collaborate without restrictions for 
granted. It was only when the Brexit refer- 
endum came along that we began to realize 


that we must insure against the future.” 

Oxford and Berlin will both benefit from the 
partnership, says Steffen Krach, state secretary 
for higher education and research in the Berlin 
state government. “Obviously, future access to 
EU funding for joint research is part of the 
motivation for Oxford to set up shop here, and 
quite legitimately so,” he says. “But we can also 
learn a lot from Oxford and their success in 
scouting international talent. Science in Berlin 
will doubtless benefit in terms of research out- 
put and reputation from lively exchange with 
one of the best universities in the world.” 


ACADEMIC ALLIANCES 

A host of other similar partnerships are at 
various stages of development. Institutions 
involved include the University of Warwick 
and Northumbria University in Newcastle, 
as well as the University of Glasgow. Last 
month, Imperial College London announced 
an expansion of its long-standing research- 
and-education partnership with the Techni- 
cal University of Munich in Germany. “We're 
naturally interested in any mechanism that 
allows us to continue fruitful collaborations 
we have established with European partners 
over the decades,’ says Maggie Dallman, 
vice-president of Imperial College. 

EU funding is one way of easing collabora- 
tion, but any mechanism to keep doors open 
in science must be transparent, says Dallman. 
“We are not seeking to find opaque backdoor 
routes to getting European funding,” she says. 
“Tt’s ultimately all about doing more research of 
a higher quality with an outstanding partner.’ 

Conroy says: “Brexit will not leave 
UK universities unaffected, but we managed 
to live through turmoil before” He adds: “No 
matter how difficult the political crisis is, we 
will see to it that our faculty and students, 
and society at large, continue to get the best 
possible scholarship and science.” = 


SOURCE: EUROPEAN COMMISSION 


PLANETARY SCIENCE 


Mars scientists push 
for ‘mega-mission’ 


Experts want NASA’s next rover to harvest rock at two sites. 


BY ALEXANDRA WITZE 


ASAs next Mars rover — the first to 

N gather rock samples meant to come 

back to Earth — should dream big 

and visit as many places on the red planet as 
possible, scientists concluded on 18 October. 

The rover’s stops would probably include 


some combination of Jezero crater, once home 
to river deltas and a lake; Northeast Syrtis, 


which contains some of the most ancient 
rocks on Mars; and Midway, a compromise 
option located between the two (see ‘Road 
Trip’). Project scientists have proposed visiting 
both Jezero, for the river and lake sediments 
that might retain signs of past life, and Mid- 
way, for the ancient rocks. The two are about 
28 kilometres apart — so visiting both would 
be ambitious but achievable. 

“The community prefers a mega-mission,” 
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says Bethany Ehlmann, a planetary scientist at 
the California Institute of Technology in Pasa- 
dena. “If we're going to do sample return, it has 
to be a sample cache for the ages.” 

The Columbia Hills region, which NASA’ 
Spirit rover explored between 2004 and 
2011, ranked much lower in the scientists’ 
poll despite having silica deposits similar to 
those formed by hot springs. “Everybody 
sort of thought we should go to a new place,” 
says Matthew Golombek, a Mars scientist at 
NASA's Jet Propulsion Laboratory (JPL) in 
Pasadena. 

The decision about where to send the 2020 
rover ultimately rests with NASA’s science 
chief, Thomas Zurbuchen, who will choose in 
the coming months. “I would be excited about 
any sample back,” says Meenakshi Wadhwa, a 
planetary scientist at Arizona State University 
in Tempe. “But we have the luxury of being 
able to choose between good sites.” 


SOURCE: NASA/JPL 


Slated to launch in July 2020, the 
US$2.4-billion rover will be the first from any 
nation to collect Mars rocks and stash them for 
a future mission that would bring them back 
to Earth. The geology of the landing site has to 
be intriguing enough — and the potential for 
scientific discoveries there great enough — to 
make the mission worth the investment. 

NASA has not planned how it would retrieve 
the rocks collected by the 2020 rover. But the 
agency gathered Mars experts in Glendale, 
California, from 16 to 18 October to hash out 
the merits of four finalists for the landing site. 

Jezero, Northeast Syrtis and Midway came 
remarkably close to one another in votes by 
169 scientists at the workshop. The researchers 
ranked the sites using several criteria, such as 
the potential of samples collected at each site to 
answer crucial scientific questions about Mars. 

The idea of visiting Jezero and then Mid- 
way — or the other way around — emerged 
in the past year as mission scientists debated 
how to get the most out of the rover’s journey. 
“Tt is ambitious as heck,’ says John Mustard, 
a planetary scientist at Brown University in 
Providence, Rhode Island. Midway’s ancient 
rocks are similar to those at Northeast Syrtis 
and near the rivers-and-lake system at Jezero. 

Sending a rover to Jezero and Midway would 
mean gambling that the vehicle would last long 
enough to reach both sites. Its primary mis- 
sion is 1.25 Mars years (2.35 Earth years); dur- 
ing that time, it is expected to travel roughly 
15 kilometres. That would get the rover around 
most of the Jezero site, if it started there, and 
possibly even to the crater’s rim. But it would 
then face a trek across dunes to Midway. 

NASAs Curiosity rover, the agency's biggest 
and most powerful so far, has travelled more 
than 19 kilometres since it landed on Mars in 
2012. The engineers developing the 2020 rover 


ROAD TRIP 
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Mars researchers want NASA to send its next rover to two sites on 
the red planet to collect rocks for return to Earth. The agency is 

now considering three potential areas for the rover to visit, shown 
here (inset) surrounded by the landing sites of previous missions. 


expect it to be able to travel faster than Curi- 
osity, in part because of new technology that 
improves its ability to navigate on its own. 
One major question is how many rock sam- 
ples the rover will collect, and from where. 
The 2020 rover is equipped with 42 sample 
tubes, 5 of which will be reserved as spares. 
That leaves 37 tubes to be filled with the most 
precious extraterrestrial rocks ever collected. 
“Sooner or later, somebody is going to have 
to decide whether these samples are worth 
bringing back,” project scientist Ken Farley, 
of JPL, told the meeting. “I don’t want to fail 
because we have not been ambitious enough.” 
At the workshop, project scientists laid out 
options for what might fill those 37 tubes. 
These include chunks of lake deposits from 
Jezero, fragments of enormous blocks of rock 
at the crater rim there and samples of the 
ancient rocks at Midway. The nuclear-pow- 
ered rover has several possible paths by which 


to navigate the 28 kilometres of dune fields 
between Jezero and Midway. Driving that dis- 
tance would take an estimated 401 Martian 
days, says deputy project scientist Katie Stack 
Morgan at JPL. 

Still unknown is where the rover might 
stash its precious samples. One possibility is 
that it could collect two similar sets of samples 
at Jezero, depositing one there and carrying 
the other on to Midway, Farley told the meet- 
ing. That would leave open the possibility of 
retrieving the samples at Jezero if something 
went wrong with the rover on its way to Mid- 
way. Other researchers back a Midway-to- 
Jezero journey, to get the ancient rocks first. 

NASA has not yet decided whether or how it 
might fetch the samples, although it has tenta- 
tive plans for a mission in the late 2020s. “We're 
actually serious about bringing these samples 
back,” Zurbuchen told the meeting. “That’s 
what we're here for.” m 


A moral map for AI cars 


Survey reveals global variations in ethical rules of the road for autonomous vehicles. 


BY AMY MAXMEN 


hen a driver slams on the brakes to 
avoid hitting a pedestrian crossing 
the road illegally, she is making a 


moral decision that shifts risk from the pedes- 
trian to the people in the car. Self-driving cars 
might soon have to make such ethical judge- 
ments on their own — but settling on a univer- 
sal moral code for the vehicles could be a thorny 
task, suggests a survey of 2.3 million people 
around the world. 

The largest-ever survey of machine ethics’, 
published this week in Nature, finds that many 


of the moral principles that guide a driver's 
decisions vary by country. For example, ina sce- 
nario in which some combination of pedestri- 
ans and passengers will die in a collision, people 
from relatively prosperous countries with strong 
institutions, such as law enforcement, were less 
likely to spare a pedestrian who stepped into 
traffic illegally. 

“People who think about machine eth- 
ics make it sound like you can come up with 
a perfect set of rules for robots, and what we 
show here with data is that there are no univer- 
sal rules,’ says study co-author Iyad Rahwan, 
a computer scientist at the Massachusetts 


Institute of Technology in Cambridge. 

The survey, called the Moral Machine, laid 
out 13 scenarios in which someone's death was 
inevitable. Respondents were asked to choose 
who to spare in situations that involved a mix 
of variables: young or old, rich or poor, more 
people or fewer. 

People rarely encounter such stark moral 
dilemmas, and some critics ask whether the 
scenarios posed in the quiz are relevant to the 
ethical questions surrounding driverless cars. 
But the study’s authors say that the scenarios 
stand in for the subtle moral decisions that 
drivers make every day. The findings reveal > 
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MORAL COMPASS 


A survey of 2.3 million people worldwide 
reveals variations in the moral principles 
that guide drivers’ decisions. Respond- 
ents were presented with 13 scenarios, in 
which a collision that killed some 
combination of passengers and 
pedestrians was unavoidable, and asked 
to decide who they would spare. 
Scientists used these data to group 
countries and territories into three groups 
based on their moral attitudes. 
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> cultural nuances that governments and mak- 
ers of self-driving cars must take into account if 
they want the vehicles to gain public acceptance, 
they say. 

“It’s a remarkable paper,” says Nicholas 
Christakis, a social scientist at Yale University 
in New Haven, Connecticut. The debate about 
whether ethics are universal or vary between 
cultures is an old one, he says, and now the 
“twenty-first-century problem” of how to pro- 
gram self-driving cars has reinvigorated it. 

Some of the world’s biggest tech companies 
— including Google, Uber and Tesla — and car- 
makers now have self-driving-car programmes. 
Many of these companies argue that the vehi- 
cles could improve road safety and ease traffic, 
but social scientists say the cars raise complex 
ethical issues. 

In 2016, Rahwan’s team stumbled on a para- 
dox about self-driving cars’: in surveys, people 
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say they want an autonomous vehicle to protect 
pedestrians, even ifit means sacrificing its pas- 
sengers — but also that they wouldnt buy self- 
driving vehicles programmed to act in this way. 

Curious to see whether the prospect of self- 
driving cars might raise other ethical conun- 
drums, Rahwan gathered psychologists, 
anthropologists and economists to create the 
online Moral Machine quiz. Within 18 months, 
it had recorded 40 million decisions made by 
people from 233 countries and territories. 

No matter their age, gender or country of 
residence, most people spared humans over 
pets, and groups of people over individuals. 
These responses are in line with rules proposed 
in what might be the only governmental guid- 
ance on self-driving cars: a 2017 report by the 
German Ethics Commission on Automated and 
Connected Driving. 

But agreement ends there. When the authors 


analysed answers from people in the 130 coun- 
tries with at least 100 respondents, they found 
that the nations could be divided into three 
groups (see ‘Moral compass’). One contains 
North America and several European and other 
nations where Christianity has historically 
been the dominant religion; another includes 
countries such as Japan, Indonesia and Paki- 
stan, which have strong Confucian or Islamic 
traditions. A third group consists of Central 
and South America, as well as France and for- 
mer French colonies. The first group showed a 
stronger preference for sacrificing older lives to 
save younger ones than did the second group, 
for example. 

Test versions of autonomous cars are cruising 
through several US cities. By 2021, at least five 
manufacturers hope to have self-driving cars 
and trucks in wide use. 

Bryant Walker Smith, a law professor at the 
University of South Carolina in Columbia, says 
that the study is unrealistic because there are few 
instances in real life in which a vehicle would 
face a choice between striking two different 
types of person. “I might as well worry about 
how automated cars will deal with asteroid 
strikes. he says. 

But Barbara Wege, who heads a group 
focused on autonomous-vehicle ethics at the 
car manufacturer Audi in Ingolstadt, Germany, 
says that such studies are valuable. Wege argues 
that self-driving cars would cause fewer acci- 
dents, proportionally, than human drivers do 
each year — but that events involving robots 
might receive more attention. 

Surveys such as the Moral Machine can help 
to prompt public discussions about inevitable 
accidents, and so might foster trust. “We need 
to come up with a social consensus,” she says, 
“about which risks we are willing to take. m 
1. Awad, E. et al. Nature https://doi.org/10.1038/ 


s41586-018-0637-6 (2018). 
2. Bonnefon, J. et al. Science 352, 1573-1576 (2016). 
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DISASTER MANAGEMENT 


Data hint at quake forecasts 


Italian-earthquake analysis suggests possibility of predicting aftershocks of some quakes. 


BY KATE RAVILIOUS 


between August and late October. Now, an 

analysis suggests the mechanism that might 
make such quakes unfold over a period of days 
or weeks, rather than as a single strike. The 
conclusion has stirred up excitement among 
earthquake researchers, because it raises the 
possibility that seismologists could make life- 
saving forecasts of the big quakes that follow the 
first, large quake in a sequence. But challenges 


I n 2016, three deadly earthquakes struck Italy 


remain, including how best to communicate the 
risk to people who might be affected. 

Currently, seismologists can forecast earth- 
quakes only in vague terms — say, estimating a 
30% chance of one ina large region in the next 
50 years. Most earthquakes take the form ofa 
single large quake followed by aftershocks of 
decreasing size. But in ‘sequence quakes, such 
as the 2016 Italy event, energy is released in 
a stop-start manner: several large quakes are 
interspersed with smaller aftershocks. Scien- 
tists aren't sure why this happens. 
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The latest research — which was published in 
August and will be presented at the American 
Geophysical Union Fall Meeting in December 
in Washington DC — suggests that the answer 
might lie in the interplay of faults and the move- 
ment of underground fluids (R. J. Walters et al. 
Earth Planet. Sci. Lett. 500, 1-14; 2018). This 
knowledge could, in theory, be used to predict 
potentially deadly follow-up quakes. 

Sequence quakes occur in all tectonically 
active areas of the world, but they are thought 
to be more prevalent in geologically young 


> fault systems. In Italy’s Appennine mountains, 
& which run the length of the country, these 
quakes occur every few decades, most recently 
in 2016, 1997 and 1979. More than 300 people 
died between 24 August and 30 October 2016 as 
aresult of the three earthquakes that hit central 
Italy, each larger than magnitude 6. The small, 
x historic town of Amatrice was badly damaged 
= by the first quake, and 299 people died. 
“Essentially, we can consider sequence quakes 
as ‘failed’ big earthquakes,’ says Richard Wal- 
ters, a geophysicist at Durham University, UK, 
who led the research. “The initial stress condi- 
tions are the same, but the cascading rupture 
of multiple segments takes place over days to 
weeks instead of over seconds? 


“AVALLARI/NURPHOTO. 


COMPARE AND CONTRAST 
To find out why, Walters and his colleagues took 
advantage of the wealth of satellite data from 
the Italian 2016 quakes. The satellites — part of 
Europes Sentinel Earth-observing constellation 
— provided images of the shape of the ground 
surface. Because the data were taken roughly 
every 1.5 days, the scientists were able to com- 
pare images from before and after each quake 
and calculate how the ground had moved. 

Combining these data with seismological and 
ground-based measurements, the team found 
that a network of smaller, cross-cutting faults 
underlies the Apennine region. The researchers 
say that these small faults act as barriers to the 
rupture process, preventing major faults from 
being ‘unzipped in one go. Had the faults all 
failed at once, the region would have experi- 
enced a single earthquake with a magnitude of 
about 6.7 — some 50% stronger than the largest 
individual quake that did strike. 

Studying the thousands of small aftershocks 


that followed the first quake, the team observed 
tiny quakes creeping northwards at a rate of 
around 100 metres a day — and found that this 
matched the speed at which naturally occurring 
underground fluids would be expected to move 
along fault lines. “The pattern of small after- 
shocks suggests that each subsequent quake is 
triggered by the increased pressure associated 
with fluids being pumped through the net- 
work of minor faults,’ says Walters. The second 
quake, two months later, occurred exactly when 
the aftershocks — and fluid, as predicted by the 
team’s models — reached the next major fault 
line. “The fluids are being driven by pressure 
changes. When they reach a fault, the increased 
pressure ‘unclamps the fault and allows it to 
move, says Walters. 


The Italian town of Amatrice was largely demolished in three earthquakes in 2016. 


Nicola D’Agostino, a geoscientist at 
the National Institute of Geophysics and 
Volcanology in Rome, finds the mechanism 
plausible. D'Agostino also agrees that it’s theo- 
retically possible to forecast the later quakes, 
but says it could be tricky to know when a 
quake is a one-off event and when it is the start 
of a sequence. 

Stephen Hicks, an earthquake scientist at 
the University of Southampton, UK, thinks 
that the findings will change how geoscientists 
work. “Normally, we don't try and interpret 
aftershocks until later, but I think this will spur 
scientists into analysing more-subtle features 
in real time,’ he says. “The challenge will be to 
monitor and interpret the data quickly enough 
to provide a meaningful forecast. m 


China’s academics await 
national journal blacklist 


But some researchers say the policy won’t succeed in improving research quality. 


BY DAVID CYRANOSKI 


proposal by the Chinese government 
Ae create its own blacklist of journals is 

creating much debate among the coun- 
try’s scientists, who are still waiting for the list 
to be revealed, five months after the plan was 
announced. 

Preparation of the list has been shrouded in 
secrecy. The government says it will include 
journals that it considers to be of poor qual- 
ity or those seeking excessive profit, but it has 
not released its selection criteria, nor has it said 


when the policy will take effect. 

A couple of commercial blacklists exist, and 
some Chinese institutions already have lists 
of journals that researchers should avoid, but 
lists run by government agencies are rare. The 
Chinese government hopes that a national 
policy will improve research integrity by 
reducing the number of low-quality or fraudu- 
lent articles from Chinese authors. Academics 
will receive warnings if they submit to the 
selected publications. 

But some researchers say that a national 
blacklist won't fix these problems and will 


be difficult to manage. Lists of approved 
publications are a better tool for improving 
research quality, they say. 

The science ministry was tasked with 
creating a blacklist in May, when the govern- 
ment announced a crackdown on scientific 
misconduct after numerous cases of fake peer 
reviews, plagiarism and the use of fraudulent 
data. At the time, the government said that 
the list would include domestic and inter- 
national scientific journals, and that publi- 
cations in these journals would no longer be 
counted towards a scientist’s applications > 
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> for promotion, jobs or grant funding. 

Nature has seen several lists compiled by 
Chinese institutions that already tell researchers 
to avoid certain publications. The Zhongshan 
Ophthalmic Center at Sun Yat-sen Univer- 
sity in Guangzhou circulated a document in 
January that warned its researchers against 
publishing in a list of journals that it labelled 
“controversial in the community” because they 
have had a lot of retractions. The Obstetrics and 
Gynecology Hospital of Fudan University in 
Shanghai has compiled another list. A repre- 
sentative of the hospital says that publications 
in the journals on its list are not forbidden, but 
that researchers cannot use grant money to pay 
the publication fees. 

The first list contains two of the world’s 
largest journals, PLoS ONE and Scientific 
Reports. Joerg Heber, editor-in-chief of 
PLoS ONE, says that he does not know why 
some Chinese universities discourage their 
researchers from submitting to the journal, 
because only a small fraction (119) of its 
almost 200,000 papers have been retracted. 

A spokesperson from Scientific Reports said 
that they were unable to comment on indi- 
vidual decisions, but hoped that institutions 
will continue to recognize the journal’s value. 
(Nature’s news team is editorially independ- 
ent of its publisher, Springer Nature, which 


also publishes Scientific Reports.) 

Omid Mahian, a thermal engineer at Xi'an 
Jiaotong University, thinks it would be better 
to have a national policy that applies to all 
researchers than for institutions to have their 
own lists. 

And Shi Xiaolei, a science historian at the 
Institute for the History of Natural Science 
in Beijing, part of the Chinese Academy of 

Sciences (CAS), 


“The real says that a national 
problem is blacklist and those 
not with the recently adopted at 
journals, but institutions “will have 
withthe person _a positive impact on 
whosubmitsthe China’s academic 
article.” environment”. 

Medical researcher 


Ren Chuanli says that a national blacklist 
might help to reduce scientific misconduct 
because it will punish some journals that 
publish low-quality manuscripts. “But the 
real problem is not with the journals, but with 
the person who submits the article,’ says Ren, 
who works at the North Jiangsu People’s Hos- 
pital. He says that some high-quality journals 
publish low-quality papers, and that some 
journals that are considered low quality pub- 
lish the occasional high-quality, highly cited 
article. Fraudulent papers, too, have appeared 


in journals of all qualities, so a blacklist based 
on overall journal quality won't necessarily 
stop those papers either, says neuroscien- 
tist Mu-ming Poo at the CAS Institute of 
Neuroscience in Shanghai. 

Blacklists are also difficult to maintain 
because new journals are always launching, 
says Lars Bjornshauge, the Cophenhagen- 
based managing director of the Directory 
of Open Access Journals. Bjornshauge also 
wonders whether China might use black- 
lists as a way of promoting Chinese journals 
over others. 

Yu Liping, who studies academic evalu- 
ation at Zhejiang Gongshang University in 
Hangzhou, doubts that the list will be com- 
prehensive and include all the problematic 
Chinese journals that have been linked to 
scientific misconduct. Yu says lists of approved 
journals that meet certain standards are a 
better tool than blacklists for improving 
research standards. 

Most researchers agree that what China 
needs is a more comprehensive system by 
which to evaluate research quality. “The 
important thing is whether those who evaluate 
research are actually evaluating the research 
and not only looking for papers in ‘inter- 
national’ journals on the researchers’ CV,” says 
Bjornshauge. = 


Hunting for HIV’s hideouts 


Scientists are testing new ways of mapping where the virus lurks in the body. 


BY SARA REARDON 


ntiretroviral drugs have transformed 
A™ infection from a death sentence 

to a chronic condition for many peo- 
ple who have the virus. But because HIV never 
truly leaves the body, the virus rebounds rap- 
idly if patients stop taking the drugs. 

Now, scientists are trying to work out how, 
and where, HIV hides when blood tests show 
that a person's viral load is low or undetect- 
able. The location of this reservoir has long 
been a mystery, but that could soon change. 
Powerful new techniques are giving research- 
ers an unprecedented look at how HIV travels 
though the bodies of people and other animals 
— turning up clues to the virus’s hiding places 
and targets for future therapies. 

HIV is a challenging foe because it inte- 
grates into the DNA of its host cells. Some 
scientists argue that a true cure would require 
the removal of all traces of the virus’s DNA 
from the body, rather than simply prevent- 
ing HIV from hijacking cells to replicate itself 
— and that goal might be unreachable. “We 


are starting to realize that getting rid of all the 
HIV DNA is not completely realistic,” says Sara 
Gianella, an infectious-disease researcher at 
the University of California, San Diego. 

The best that researchers can hope for may 
be to permanently silence or contain HIV 
after infection, says Gianella, who last week 
presented new findings on HIV’s hideouts at a 
meeting organized by the US National Institutes 
of Health (NIH). Although antiretroviral drug 
cocktails — known as ART — can suppress HIV 
in immune cells in the blood, the first results 
from Gianella’s “Last Gift” study confirm that 
the virus can stash itself in dozens of tissues. 

The project examines bodies donated by peo- 
ple with HIV who enrol when they are within 
six months of death from unrelated conditions. 
All participants are on ART when they sign up 
for the study, but some are asked to stop taking 
the drugs. Gianella’s team collects blood samples 
while donors are alive, and about 50 different 
types of tissue after death. The samples from 
people who stopped taking ART show where 
HIV has rebounded, whereas samples from 
people who continued on the drugs can provide 
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clues about the virus’s reservoir. 

The researchers did not detect HIV in the 
blood of their first donor, who continued tak- 
ing antiretroviral drugs until his death. But 
they did find viable virus in nearly all of the 
26 tissues they examined after the man died. 

Janice Clements, a pathobiologist at Johns 
Hopkins University in Baltimore, Maryland, 
calls the project “amazing”. After a person 
with HIV dies, she says, researchers can rarely 
acquire tissue samples quickly enough to 
measure the level of virus present, and they 
usually don’t know whether the dead person 
had been taking antiretroviral drugs. 

Clements’ research has shown that HIV 
tends to linger in the brain and cause neuro- 
logical problems, because most antiretroviral 
drugs can't cross the blood-brain barrier. At 
the meeting, Clements and her colleagues pre- 
sented the first evidence that simian immuno- 
deficiency virus (SIV), which is closely related 
to HIV, survives in the spinal cord of macaques 
taking antiretroviral drugs and spreads quickly 
after the animals stop taking the drugs. “This is 
unlike any virus we've cured,’ she says. 
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Nicolas Chomont, a virologist at the Uni- 
versity of Montreal in Canada, says that HIV’s 
behaviour in tissues that are known to be part of 
the reservoir is complex, with virus levels going 
up and down. “People will tell you the reservoir 
is everywhere, and that might be true,’ he says. 
“Even if it’s true, we need to understand if [the 


HIV invades immune cells called T cells and uses them to replicate itself. 


virus] in the brain and the big toe are the same? 

Tracking these patterns over time might 
require identifying traces of HIV in the reser- 
voirs of living people — a daunting task. “For 
these very rare events, you need to know where 
to look,” says Thomas Hope, a cell biologist at 
Northwestern University in Evanston, Illinois. 


IN FOCUS | NEWS 


At the NIH meeting, Hope presented a new 
imaging technique in macaques infected with 
SIV. The researchers inject the animals with 
antibodies that bind to the virus, which makes 
it visible in positron-emission tomography 
(PET) scans of the monkeys’ bodies. 

The approach has revealed that SIV spreads 
through mucosal cells in the animals’ guts and 
lymph nodes within hours of infection. Hope 
has begun to treat infected macaques with 
ART to determine where and how quickly 
the drugs lower their levels of SIV. After six 
months of treatment, the researchers plan to 
stop the treatment and scan the monkeys to 
see where the virus has rebounded. 

And later this year, another team will begin 
one of the first PET imaging studies of people 
with and without HIV in their blood, using 
a different antibody. Timothy Henrich, an 
infectious-disease researcher at the University 
of California, San Francisco, who is directing 
the study, says that his group hopes to measure 
what happens when people on ART stop tak- 
ing the drugs. Gianella’s team also wants to test 
PET imaging in Last Gift participants. 

Ultimately, working out where HIV hides 
will require researchers to do more than just 
measure the virus in blood, Hope says. Blood 
immune cells are “going to be a more minor 
player compared to the dark-matter reservoir’, 
he says. “We know it’s there” m 
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Researchers are ramping up plans for living on the Moon. 


BY ELIZABETH GIBNEY 


ext year, astronaut Matthias Maurer expects to walk on the 
surface of the Moon — but without the hassles of a rocket 
flight, zero-gravity nausea and a risky landing. Instead he'll 
stroll close to home in a leafy meadow near Cologne, Ger- 
many, which is set to host the largest Moon mock-up ever made. On 
a pit of artificial lunar dust covering more than 1,000 square metres, 
Maurer and other scientists will be attached to crane-and-pulley sys- 
tems that allow them to leap as if experiencing the Moon's weaker grav- 
ity, and work under adjustable lamps that simulate lighting at different 
lunar sites. Sometimes, they will retreat to lunar-style living quarters: an 
airlock-connected module the size of a shipping container. 

It’s an exciting playground for testing lunar technology, says Maurer, 
who is a project manager for the multimillion-euro facility. Called 
LUNA, the mock-up is taking shape outside the European Astronaut 
Centre in Cologne, with funding from the European Space Agency 
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(ESA) and the German Aerospace Center (DLR). But at 48 years old, 
Maurer doesn’t know whether he will ever put his skills to use on the 
genuine article. “Hopefully I will make it before retirement. Technically, 
I believe it’s feasible that I will still walk on the Moon; he says. 
Maurer’s optimism isn't entirely outlandish. He was only two the last 
time someone visited the Moon for real: US astronaut Eugene Cernan, 
in the last mission of NASA’s Apollo programme. No space agencies 
have yet committed money to send people back. But, partly as a result 
of changing political priorities, momentum for human return to the 
Moon is growing. Rather than rerun the Apollo missions, space agencies 
are slowly warming to the idea of establishing a sustainable settlement. 
Researchers relish the idea of a base for conducting experiments on 
the Moon and asa way to trial technologies for heading to Mars. Private 
firms, however, are increasingly tempted by the possibility of mining 
oxygen and hydrogen — which power rockets — from lunar ice. If that 
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does pan out, then the Moon could become a refuelling station, radi- 
cally reducing the expense of space travel. “Water is the oil of space, and 
there's mounting evidence that it’s there in economically viable deposits,” 
says George Sowers, an aeronautical scientist at the Colorado School of 
Mines in Golden and previously chief scientist at United Launch Alli- 
ance, a firm in Centennial, Colorado, that provides launch services for 
the US government. 

Space agencies are generally reluctant to predict a timeline for a 
crewed Moon base — in part, because the goal lies well outside their 
budget horizons, but also because it requires businesses to provide much 
of the money for the various stages involved. But ESAs director-general, 
Jan Worner, has for years talked about multiple countries and companies 
collaborating in a semi-permanent settlement, which he calls a ‘Moon 
village. China’s National Space Administration has also been quoted 
in state media saying that a Moon base is its goal, although it has not 
announced when that might happen. 

Lunar exploration prospects gained a boost last December, when a 
US presidential directive told NASA to switch its sights from explor- 
ing asteroids to returning humans to the Moon. NASA has since asked 
companies to develop lander technology and has outlined plans to 
request billions of dollars over the next five years for new programmes 
to support lunar exploration, leading to eventual human return. “It’s 
a fairly radical change of direction,” says Sowers. And this October, 
European engineering firm Airbus launched a contest called the Moon 
Race, with supporters that include ESA and the US spaceflight firm Blue 
Origin, which promises to fund companies to develop key technologies 
for the sustainable development of the Moon. 

In the next few years, the only missions to touch down on the Moon 
will be robotic: China and India will launch probes in the next three 
months, and Russia has one scheduled within the next five years. But 
NASA, ESA and the space agencies of Russia, Japan and Canada all sup- 
port the idea of building a space station in 
orbit around the Moon by the mid-2020s. 
(In his 2019 budget request, US Presi- 
dent Trump proposed that NASA spend 
US$2.7 billion on the project over the 
next 5 years.) That orbiter could provide 
a base from which to make short, multi- 
week crewed trips to the Moon's surface 
in a pressurized rover — which Maurer 
calls the “camper-van solution” Finally, a 
settlement could follow. “I think that 20 
years is a realistic time frame for a lunar 
surface infrastructure of some sort, either 
inhabited by or tended by humans,’ says 
James Carpenter, strategy officer for human and robotic exploration at 
ESA in Noordwijk, the Netherlands. 

Researchers have long explored ways to harvest lunar resources, but 
more in hope than expectation. Now, they are ramping up lunar settle- 
ment technology with genuine anticipation that their work might be 
put into action. While Maurer and others at ESA’s LUNA centre practise 
living on and mining the Moon, others are working on how to grow 
food and build radiation-proof shelters (see “Living on the Moon). Ata 
July meeting at ESAs European Space Research and Technology Centre 
in Noordwijk to prepare for future human Moon missions, more than 
250 specialists from academia, mining, metallurgy, construction and 
architecture pitched their ideas. “If you ran the same workshop five years 
ago, it may have only been a handful of people,” says Aidan Cowley, 
science adviser at the European Astronaut Centre. “The appetite has 
really increased.” Although a Moon base might still never happen, Earth- 
bound preparation for lunar living is well under way. 


WATER MINING 

Lunar settlers’ first challenge will be harvesting water. The Apollo 
missions, which collected samples from the Moon's equator, suggested 
that the satellite is dry and barren. So the discovery a decade ago that the 
Moon's poles harbour patches of water ice “was a game changer’, says 


“WE'D LIKE TO KNOW 
WHATIT TAKES T0 BUILD 
LIVING SOILS OUT OF 
WHAT IS ESSENTIALLY 
SPACE DUST.” 
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Robert Mueller, a senior technologist at the NASA Kennedy Space Center 
in Cape Canaveral, Florida, who develops lunar-mining technologies. 

For now, researchers don't exactly know where the ice is, how 
thick it is or whether it is mixed with soil or packed in sheets. India’s 
Chandrayaan-2 rover, scheduled to launch next year, and Russias LUNA 
27 lander, planned for 2022, will target these questions. Russia's lander 
will have a 2-metre-long drill designed by ESA, and a laboratory to study 
the origin and abundance of lunar water. NASA also wants to hunt for 
that water, and has commissioned a suite of companies to develop lunar 
landers that would carry prospecting instruments, beginning as early 
as next year. A four-crew human base would need a negligible amount 
of this water — perhaps dozens of tonnes per year, says Sowers — and 
there’s plenty of it. “Estimates based on current data suggest there may 
be 10 billion tonnes per pole,” he says. 

The vast majority of the ice would be mined for fuel. Sowers calculates 
that mining firms could turn a profit by extracting around 1,000 tonnes 
of water a year and electrolysing it into its constituent oxygen and hydro- 
gen parts for propellant. The Moon's low gravity means that it would be 
much cheaper to stock up for long-distance space travel from there than 
itis from Earth. As an example, a lunar return mission that refuelled at 
the Moon would cost just one-fiftieth of the price of one that brings all 
its fuel with it from Earth. 

In August, scientists using data from India’s Chandrayaan-1 orbiter 
found that the Moon's ice lies on its surface — but in permanently 
shaded craters as frigid as —-249 °C, the naturally coldest spots known 
in the Solar System. Excavation machines would need heat and power to 
release the water and turn it into propellant. Because plutonium-based 
batteries, which rely on the heat generated by the natural decay of radio- 
isotopes, are too expensive for most private firms, lunar prospectors will 
probably have to harness energy from the Sun. 

They will take inspiration from southern Norway, where giant mirrors 
set high on a mountain overlooking the 
town of Rjukan have since 2013 beamed 
sunlight onto a patch of the town’s central 
square that would otherwise be grey and 
chilly all winter. Prospectors would hope 
to do something similar on the Moon, 
says Sowers. Light from high peaks could 
be directly channelled into the craters, he 
says, where it would heat the ice and turn 
it into vapour. From there, condensed 
water would be shuttled to a processing 
plant and split by solar electricity into 
hydrogen and oxygen. These gases could 
then be stored and either used as propel- 
lant or channelled through fuel cells to supply energy. 

Alternatively, rovers could scoop up ice-filled soil and warm it in ovens 
to release water. The ovens could be powered wirelessly, by training high- 
power lasers onto photovoltaic cells on the rover. The LUNA facility could 
test how this would work in real life, with added challenges such as Moon 
dust in the atmosphere that scatters the laser beams, says ESA’s Leopold 
Summerer. LUNA scientists will also clamber into life-like Moon craters 
to see how easy it is to navigate these deep, dark slopes, Maurer says. 


LIVING OFF THE SOIL 

Ifice isn't accessible, there is an alternative source of water on the Moon: 
its soil, also known as regolith. Regolith contains silica and metallic 
oxides that make it — on average — 43% oxygen by mass, and it is 
found everywhere on the Moon. Oxygen grabbed from soil could power 
scientifically or economically interesting outposts far from the poles, 
and produce useful by-products such as rare metals. 

Regolith doesn't give up its riches easily. Releasing oxygen from its 
chemical bonds is more energy-intensive than heating ice. In theory, a 
reactor could use giant mirrors to channel sunlight onto a furnace little 
bigger than an envelope, heating Moon dirt to more than 900°C until it 
glows. At that temperature, hydrogen or carbon, brought initially from 
Earth, can strip oxygen from its minerals and bind the element together 
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LUNAR ORBITAL 
PLATFORM 


Humanity’s next international 
outpost in space might be a 
lunar orbiter, after the 
International Space Station is 
retired, probably in the 
mid-2020s. Supported by 
NASA, ESA, JAXA and others, 
this hub could launch rovers 
to the Moon and act as a 
staging post for humans 
travelling to the lunar surface. 
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Prototype rovers are doing test runs in Spain's Canary Islands. 


with hydrogen to make water. A field test in Hawaii in 2010 on simulated 
lunar regolith demonstrated that the process was feasible — although 
working in low gravity and a vacuum is untested. “Essentially it’s a proven 
technology and would be ready to go within a few years,’ says Mueller. 

Researchers hope to improve the process further to cut down on what 
needs to be brought from Earth. At the Polytechnic of Milan in Italy, a 
group led by aerospace engineer Michéle Lavagna is developing a pro- 
totype that works at lower temperatures and recycles all the inputs — in 
this case, methane and hydrogen — so that soil is the only consumable. 
Currently, one device might take decades to generate enough water to 
power a single Apollo-style lander back into orbit. But Lavagna says that 
on the Moon, multiple reactors could work in parallel. With around 
€600,000 (US$692,000) of ESA funding, her team is now working on 
a demonstrator plant that would be small enough to fly on a mission. 


FINDING SHELTER 
If it turns out that water can’t be harvested for profit, an outpost 
based around scientific experiments will still develop, argues Maurer. 
“Without a commercial perspective, it simply will take much longer to 
materialize,” he says. “We might end up ina situation similar to research 
in Antarctica — driven mainly by pure scientific interest.” 

Researchers are excited about the experiments that a return to the 
Moon could yield, says Robin Canup, a planetary scientist at the South- 
west Research Institute in Boulder, Colorado. Sampling the Moon’s 
ancient craters could reveal how the Moon-Earth system formed, when 
the early Solar System was in a state of flux and asteroids pummelled the 
satellite. Researchers would also like to study the Moon's water cycle and 
its seismology — and to install a radio-telescope shielded from Earth 
interference, which could study radiation from the early Universe. 

Unlike in Antarctica, however, lunar residents need to be sheltered from 
the charged particles of radiation and tiny meteorites that rain down from 
space — because the Moon has next to no protective atmosphere or mag- 
netic field. The first flat-pack shelters are likely to be brought from Earth, 
but would need to be covered with metres of sand or regolith, Maurer 
says. One solution is natural: to exploit cliffs, canyons, caves and lava 
tubes — tunnels caused by ancient volcanic activity — to protect living 
quarters. Last year, scientists reanalysing radar data from the Japanese 
space agency’s SELENE orbiter and density measurements from NASAs 
GRAIL mission found a candidate tunnel that apparently runs for kilo- 
metres, beneath the Marius Hills on the Moon’s near side. On Earth, 
researchers have already practised commanding rovers to drive in lava 
tubes in Lanzarote, in Spain’s Canary Islands, to prepare for future lunar 
exploration. 

A few hundred metres from LUNA, ina DLR laboratory in Cologne, 
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Matthias Sperl is testing another idea — growing artificial 
stone from regolith. In Sperl’s lab, a beam of intense light, 
concentrated on a coin-sized patch, fuses together fizzing 
sheets of powder at a searing 1,100°C. Over time, these layers 
add up to create dark grey, grainy bricks, as in a 3D printing 
process. On the Moon, sunlight could be concentrated to do 
the same thing, says Sperl, whose project is part of RegoLight, 
a €1-million collaboration between the DLR, Belgian aero- 
space company SAS and architecture and engineering firms. 
The sheets do not bond together perfectly, but the bricks are 
already around one-fifth as strong as concrete and compara- 
ble to plaster, says Sperl. Architecture firms Bollinger Grohm- 
ann Schneider and Liquifer Systems Group, both in Vienna, 
showed in April that by interlocking the bricks into arches 
and domes, they could create robust structures. Sperl says 
these are solid enough to survive Moon quakes and bear the 
weight of more gravel piled on for protection. Currently, the 
process takes around 5 hours to make a single brick — but 
channelling more sunlight could speed it up, he says. Sci- 
entists elsewhere are exploring making shelters by searing 
regolith together in microwave ovens, or binding it together 
with materials brought from Earth, such as polymers. 


KALE DIET 

Plant scientists have also spent a good deal of time thinking about the 
final ingredient needed ina self-supporting Moon base: food. As part of 
a closed ecosystem, plants would recycle organic waste and turn carbon 
dioxide into oxygen to breathe. In May, Chinese state media reported 
that volunteers finished a record 370-day stay inside such an ecosystem, 
a simulated base known as Lunar Palace 1, in which they grew crops and 
raised mealworms for protein. 

Astronauts on the International Space Station (ISS) already eat space- 
grown lettuce and other leafy greens. A NASA programme run from 
the Kennedy Space Center, known as Veggie, has helped to select crops 
that grow well in confined spaces and are packed with the nutrients 
that degrade most in storage — vitamins Cl, K and potassium. Kale is 
a winner. “It’s a powerhouse that hits everything,” says Veggie project 
manager Trent Smith. 

On the Moon, astronauts would grow plants in water under white 
and red LEDs which they can tweak to alter the mineral and vitamin 
composition of the plant. Next year, ISS tests will examine how the com- 
position of tomatoes changes depending on the light. More studies will 
be needed to establish how best to grow crops in regolith’s mixture of 
metals. “Wed like to know what it takes to build living soils out of what 
is essentially space dust,’ says Smith. If plants can grow in regolith, adds 
Veggie researcher Matthew Romeyn, “suddenly that means bringing 
small fruit trees, not just leafy greens.” 

“Tf humans can only stay for short periods of time because of hazards, 
and if they can't grow food locally, the programme will implode,’ Muel- 
ler says. Another barrier might be a legal one: the 1967 Outer Space 
Treaty, which all major space-faring nations have signed, states that 
no country can “appropriate” any part of a celestial body. Most nations 
today accept that this does not rule out mining in space, says Dimitra 
Stefoudi, an expert in space law at the University of Leiden in the Neth- 
erlands. Since 2015, two countries, the United States and Luxembourg, 
have enacted national laws allowing space mining, to promote nascent 
companies. (Russia and Belgium are among countries which argue that 
mining needs a new international framework.) But the 1967 treaty also 
says that space activities should be for the benefit of all countries and 
humankind, so firms will still need to find ways to share know-how and 
the eventual gains of harvesting resources on the Moon, Stefoudi says. 

Ultimately, says Mueller, setting up camp on the Moon is likely to be a 
desire checked not by technology, but by political will and economics. “If 
we can solve both of those, I absolutely believe that permanent inhabita- 
tion of the Moon will happen.” = 


Elizabeth Gibney is a reporter for Nature in London. 
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CITIZEN SCIENCE 


COMES 


OF AGE 


Efforts to engage the public in research are bigger and more 
diverse than ever. But how much more room is there to grow? 


BY AISLING IRWIN 


ilip Meysman knew he had made his mark on Antwerp when 

he overheard commuters discussing his research project on 

the train. Then, just a few days later, he saw an advertisement 

about his work on television. There it was, he says, “in between 
the toothpaste and George Clooney’s Nespresso’. 

As a biogeochemist at the University of Antwerp in Belgium, 
Meysman wasn't used to drawing so much attention. But that was before 
he adopted the citizens of northern Belgium as research partners. With 
the help of the Flemish environmental protection agency and a regional 
newspaper, Meysman and a team of non-academics attracted more than 
50,000 people to CurieuzeNeuzen, an effort to assess the region’s air 
quality (the name is a play on Antwerp dialect for ‘nosy’ people). 

The project ultimately distributed air-pollution samplers to 20,000 
participants, who took readings for a month (see ‘Street science’). More 
than 99% of the sensors were returned to Meysman’s laboratory for 
analysis, yielding a bounty of 17,800 data points. They provided Mey- 
sman and his colleagues with information about nitrogen dioxide con- 
centrations at ‘nose height’ — a level of the atmosphere that can’t be 
discerned by satellite and would be prohibitively expensive for scientists 
to measure on their own. “Tt has given us a data set which it is not pos- 
sible to get by other means,’ says Meysman, who models air quality. 
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Citizen science — active public involve- 
ment in scientific research — is growing bigger, 
more ambitious and more networked. Beyond 
monitoring pollution and snapping millions of 
pictures of flora and fauna, people are building 
Geiger counters to assess radiation levels, pho- 
tographing stagnant water to help document 
the spread of mosquito-borne disease, and tak- 
ing videos of water flow to calibrate flood models. And an increasing 
number are donating thinking time to help speed up meta-analyses or 
assess images in ways that algorithms cannot yet match. 

The movement is surfing wider societal forces, including a thirst for 
data; the rise of connectedness and low-cost sensor technologies; and a 
push to improve the transparency and accessibility of science. Increas- 
ingly, government institutions and international organizations are get- 
ting in on the action. The US and Scottish environmental protection 
agencies, for example, have incorporated citizen science in their routine 
work. The United Nations Environment Programme is exploring ways 
of using citizen science to both monitor the environment and stoke envi- 
ronmental concern. And the European Commission has made a range of 
funding opportunities available for citizen science within its €80-billion 


Japanese priest 
Sadamaru Okano 
stands beneath 

a Geiger counter 
(top left) that sends 
radiation readings to 
the Safecast project. 
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(US$92-billion) Horizon 2020 research and innovation programme. 

At the same time, citizen-science proponents have grand visions for 
the future of the field. They hope that such efforts will become a major 
source of high-quality data and analysis in areas relevant to policy- 
makers as well as scientists. In December, multiple citizen-science 
organizations banded together to form a worldwide group — the Citizen 
Science Global Partnership. One of its first tasks is to explore how citi- 
zen science can help to monitor progress towards the UN’s Sustainable 
Development Goals, which aim to address global challenges ranging 
from hunger to environmental degradation by 2030. 

To gain legitimacy, many expect that the field will have to overcome 
lingering concerns about the reliability of its measurements and its use- 
fulness in research. “There needs to be some type of acceptance and 
institutionalization of citizen science,’ says Steffen Fritz, a specialist in 
Earth observation and citizen science at the International Institute for 
Applied Systems Analysis (IIASA) in Laxenburg, Austria. “It needs to 
be not just bottom-up — it needs also to be accepted as some kind of 
official data stream” 


COUNTERS AND ENCOUNTERS 

The origins of citizen science go back at least a couple of millennia. In 
ancient China, migratory locusts frequently destroyed harvests, and 
residents have helped to track outbreaks for some 2,000 years. The mod- 
ern form of such research arose after science became a professional 
activity, creating a cohort of interested outsiders in the process. The 
phrase ‘citizen science itself was coined in the mid-1990s. Alan Irwin, 
a sociologist now based at the Copenhagen Business School, defined it 
both as “science which assists the needs and concerns of citizens” and 
as “a form of science developed and enacted by the citizens themselves”. 

Some of the earliest modern citizen-science projects, starting with 
bird counts in the early twentieth century, involved concentrated 
outdoor campaigns to record animal sightings. Since then, public 
involvement has grown to encompass a range of roles. Muki Haklay, 
a geographer at University College London, has outlined a taxonomy 
of involvement, from ‘crowdsourced citizen science, in which lay peo- 
ple contribute data or volunteer computing power, to ‘co-created’ and 
‘collegial’ research, in which members of the public actively engage in 
most aspects of a project, or even conduct research on their own. 

In areas such as biodiversity, where citizen science first thrived, 
projects are breaking boundaries through the sheer volume of partici- 
pants and data. The Global Biodiversity Information Facility, the world’s 
largest such repository, says that it gets half of its billions of data points 
from lay sources. The group estimates that it has supplied data for more 
than 2,500 peer-reviewed papers in the past ten years. 

At iNaturalist, a social network to which anyone can submit a photo- 
graph of their encounters with flora and fauna, co-director Scott Loarie 
has presided over a doubling of submitted images every year since it was 
launched in 2008. He tries to trace scientists’ use of iNaturalist data and 
has counted 150 papers so far — but he thinks that the actual number 
is much higher because many of the papers don't cite the organization. 

Other researchers have enlisted the public in more-involved projects 
to enhance research activities, including checking data derived from 
other sources. When a team published a paper’ in 2011 suggesting that 
there could be enough marginal land to grow biofuel sufficient to meet 
half the world’s liquid-fuel needs, Fritz recruited an army of citizen 
analysers to participate in the IIASA’s Geo- Wiki project to study the 
claim. After working through thousands of images from Google Earth, 
they generated estimates of land use that were hundreds of millions of 
hectares lower than those of the original paper’. “We downgraded the 
initial estimates drastically,” says Fritz. 

Fritz thinks that some people are attracted to his projects because they 
want to contribute to science, whereas those who become most involved 
are drawn to the prospect of co-authorship on papers. Some simply like 
the offer of Amazon vouchers, he says, or a few euros. 

Other projects can draw participants for political and social reasons. 
Within days of Japan’s Fukushima Daiichi nuclear disaster in 2011, a 
small group mobilized to distribute Geiger counters (and ultimately 
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STREET SCIENCE 


In May, a collaboration that included the 
Flanders Environment Agency in Belgium 
ran a month-long citizen-science 
campaign to help test a computer model 
of air quality in the region. 


Participants installed nitrogen dioxide 
samplers on first-floor, street-facing 
windows inside V-shaped signs to 
create a standard measurement set-up. 


All told, some 20,000 people participated in project across the 
Flanders region. Each paid €10 (US$11.5) to join the experiment. 


NETHERLANDS 


2 


BELGIUM 


The results are still 
being formally assessed. 
But early analysis has 
revealed — among 
other things — that the 
ability of some building 
arrangements to 
concentrate traffic 
exhaust in “street 
canyons” had been 
underestimated. 


NO, concentration 
Low 


DIY assembly kits) to anyone who wanted to measure radiation levels 
themselves. At times, local and central governments were hostile to the 
effort, says Azby Brown, an architect and a leader of the group, now 
called Safecast. But the findings proved useful, exposing inaccuracies 
in government readings: high counts where people had been told it 
was safe to go, and low counts in places that had been deemed unsafe. 
There is still scepticism about these citizen- generated data, Brown says, 
although the International Atomic Energy Agency has invited him to 
speak at several meetings over the past few years. 

But it’s not just lay people with concerns or scientists with a bright 
idea who trigger projects: governments and their funding arms are also 
getting involved. With the support of the European Commission, for 
example, a project called Ground Truth 2.0 has set up six pilot ‘citizen 
observatories’ in Africa and Europe. Each is designed to encourage a 
three-way conversation between laypeople, scientists (or those who 
process the data) and those who could benefit from the data, such as 
policymakers or local authorities. Ground Truth 2.0’s leader, Uta Wehn, 
a researcher at the IHE Delft Institute for Water Education in the Neth- 
erlands, says that earlier citizen observatories funded by the European 
Union included the public as an afterthought. But here, scientists don't 
dictate the project; they choose the location and let interest groups 
decide what issue they want to explore and how to do it. “We're putting 
the people before the sensors,’ she says. 

One observatory, which is examining deteriorating water quality in 
the Malaren region of Sweden, found out through early discussions that 
the existing data on water quality are dispersed, and that local people 
who do the monitoring had no connection with the decision-makers. 
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Two years in, Wehn says it is too early to say whether such projects are 
changing policy. But participants laud the relationships that have been 
built between various stakeholders, she says. 

Some research leaders are looking to citizen science to foster more 
inquisitiveness in the ‘post-truth’ era, in which emotional appeals 
often seem to win out against fact-based arguments. Francois Taddei, 
co-founder of the Center for Research and Interdisciplinarity in Paris, 
thinks that citizen science can revive critical thinking. Children exposed 
to such projects are “much less prone to fake news and all these problems 
that we are facing in the information age’, he says. 


GROWING PAINS 

Yet, even as its aspirations become grander in scale, citizen science faces 
a number of challenges, including data quality and recruitment — in 
terms of both persuading more scientists to work on such projects and 
enlisting enough citizens to participate in them. 

Papers published in the past few years have identified flaws in citi- 
zen-sourced data, including deviations from standard protocols and 
biases in recording or in the choice of sampling sites**, Graham Smith, 
a wildlife ecologist who analyses sightings made by members of the 
public for the London-based Mammal Society, a British conservation 
charity, says that Sunday ramblers will ignore yet another rabbit bound- 
ing across their path but unfailingly note a more spectacular sighting 
such as an otter, which is “the most recorded mammal in Britain for its 
population size”. 

Smith, who works for the UK Department for Environment, Food and 
Rural Affairs, has explored statistical approaches to combat this bias. New 
apps that tracka citizen’s route and time in the field are also enriching the 
data, he says. Meanwhile, simple techniques exist for testing the quality 
of online analysis, says Fritz. His group inserts occasional control submis- 
sions that test a contributor’s conclusion against a predetermined profes- 
sional one (those who regularly fail — about 5%, estimates Fritz — are 
dropped, whereas those who do well can progress to become co-authors 
of papers). Scent, a project that uses a gaming app to encourage citizens to 
photograph land use, has humans and algorithms check one another for 
errors, says Daniele Miorandi, a communications engineer for the project. 

Some academics fear that the public is getting fatigued by all the 
options, and note that participation in some projects, such as the United 
Kingdom's long-running Big Garden Birdwatch project, has declined. 
In an unpublished paper, Haklay has estimated that the number of peo- 
ple globally who could be drawn into regular data collection is about 
1.7 million. “You can get a lot of people for a short time investment, or 
very few people for a deep and intensive engagement, but you can’t get 
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Youth-programme participants Donovan Wooten and Maya Sanders record observations with iNaturalist. 


everyone doing it all the time,” he says. 

Researchers and participants are also 
encountering challenges with ethics, data 
use and privacy. In Kenya, for example, one 
of Wehn’s citizen observatories is a mapping 
project that enables people to note poaching 
incidents, wildlife encounters and fencing, 
which can be harmful to animals. But the data 
gathered could be used for nefarious purposes. 
“Sightings by the tourists might be perfect for 
the poachers,’ says Wehn. She says the team 
is in careful discussion with authorities about 
what data can be disclosed. 

These issues are likely to grow, particularly 
with the rise of health-monitoring apps. Philip 
Mirowski, a historian at the University of Notre 
Dame in Indiana, has raised concerns about the 
fate of citizen data. He points to projects, such as 
PatientsLikeMe, that ask people to upload medi- 
cal information. At least in the United States, he 
says, “the people who generate the data really 
don't have any say in what's done with it” 

Meanwhile, leaders in the field are pushing 
for more professionalization, by attempting 
to systematize the available research and agree on common method- 
ologies. The Open Geospatial Consortium, an international alliance of 
businesses, research institutes and government groups, has launched 
a taskforce to get citizen data streams to talk to one another. And the 
US-based organization SciStarter, an affiliate of Arizona State University 
in Tempe, has made tools and other resources available for avoiding 
pitfalls in rolling out projects. 

Some are sceptical of efforts to manage citizen science from the top 
down. Michiel van Oudheusden, a sociologist at the Catholic Univer- 
sity of Leuven in Belgium who has studied the example of Fukushima 
Daiichi, says that citizen science can be especially valuable when it 
is unaligned with the establishment. “Subversiveness can be very 
productive,” van Oudheusden says. 

But Martin Brocklehurst, an environmental consultant and citizen- 
science advocate, believes that the benefits of bringing order to the field 
outweigh those of being an outsider. “Too much of citizen science is like 
a fireworks display: it’s great science, but it’s short-lived,” Brocklehurst 
says. “We need to start embedding it into the routine way that we do 
science to support the policy-making process.” 

Perhaps that is what CurieuzeNeuzen has achieved. The group thinks 
itreached a world record in the density of air-quality measurements. Now 
the people of Flanders are mulling over the findings. Among other things, 
the results revealed that the centres of rural villages, which were thought 
to have pure air, in fact have high levels of traffic-related air pollution. 

The project has opened political doors that more-subdued announce- 
ments by the scientific community might never have done. Air qual- 
ity became a theme in local Flemish elections, which were held in 
mid-October. Meysman says that he has received many invitations to 
present his data. And the European Environment Agency says that it 
aims to apply the approach more widely. 

Still, Meysman says, citizen science isn’t always feasible. Less- 
established scientists, under pressure to publish, could not afford the 
time he has devoted to the CurieuzeNeuzen project, he says. Person- 
ally, he has loved watching the effort unfold — the communications 
campaign, the wave of public interest, the valuable new data — and the 
chance to put the results to practical and political use. “If I had collected 
the data myself, I would have had much less impact.” = 
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Aisling Irwin is a freelance journalist in Oxfordshire, UK. 
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Do authors comply with 
mandates for open access? 


The first large-scale analysis of compliance with open-access rules reveals that rates 
vary greatly by funder, report Vincent Lariviere and Cassidy R. Sugimoto. 


collectively called for research publica- 
tions to be made free, fully and immedi- 
ately; so far, 14 funders have signed up. Before 
that, at least 50 funders and 700 research insti- 
tutions worldwide had already mandated 
some form of open access for the work they 
support. Federally funded agencies and insti- 
tutions argue that taxpayers should be able 
to read publicly funded research, and that 
broader accessibility will allow researchers 
whose institutions do not subscribe to a par- 
ticular journal to build on existing research. 
However, few empirical analyses have 
examined whether work supported by fund- 
ing agencies with such mandates actually 


| ast month, European research funders 


is open access’ *. Here, we report the first 
large-scale analysis of compliance, focusing 
on 12 selected funding agencies. Bibliometric 
data are fraught with idiosyncrasies (see 
‘Analysis methods’), but the trends are clear. 
Of the more than 1.3 million papers we 
identified as subject to the selected funders’ 
open-access mandates, we found that some 
two-thirds were indeed freely available to read. 
Rates varied greatly, from around 90% for 
work funded by the US National Institutes of 
Health (NIH) and UK biomedical funder the 
Wellcome Trust, to 23% for work supported by 
the Social Sciences and Humanities Research 
Council of Canada (see ‘Mandates matter’). 
Our findings have policy implications. 


They highlight the importance to open access 
of enforcement, timeliness and infrastructure. 
And they underline the need to establish sus- 
tainable and equitable systems as the financial 
burdens for science publishing shift from 
research libraries to authors’ research funds. 


FREE FOR ALL 

Funders with open-access mandates have 
varying incentives, opt-out mechanisms, 
copyright protections, deposit guidelines 
and other associated infrastructures and 
requirements. These affect when, how and 
how much work is made open. Our analysis 
did not assess licensing and instead counted 
articles found to be freely available to > 
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MANDATES MATTER 


About two-thirds of papers under open-access (OA) mandates are free to read*, either from repositories (green OA) or journal websites (gold OA), with US funders 


favouring repositories. Of open papers, about half are available by both routes. 


VERY VARIED ACCESS 


Rates of compliance vary greatly by funder, although they mostly trend 
upwards. Dips in 2017 are due to embargoes (which delay access for 
fixed periods after publication). 


--—- Before mandate adoption 
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MRC, Medical Research Council (UK); BBSRC, Biotechnology and Biological Sciences Research Council 
(UK); EPSRC, Engineering and Physical Sciences Research Council (UK); ESRC, Economic and Social 
Research Council (UK); ERC, European Research Council; NIH, US National Institutes of Health; Gates, 
Bill & Melinda Gates Foundation; NSF, US National Science Foundation; CIHR, Canadian Institutes of 
Health Research; NSERC, Natural Sciences and Engineering Research Council (Canada); SSHRC, Social 
Sciences and Humanities Research Council (Canada). 
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GREEN AND GOLD 

Since 2009, the proportion of papers available to read* as both gold and green 
has soared, even as the proportion of green-only access has stayed relatively 
constant, and gold-only has dropped. 


™Gold only MGreen and gold M|Green only MNo open access identified 


19% in 2016 16% 21% 12% 
(69% in 2009) (5%) (51%) (10%) 
EPSRC ESRC 
United United 
Kingdom Kingdom 
25% 40% 19% 48% 
(20%) (6%) (22%) (17%) 

4% 8% 3% 


(18%) (21%) (11%) 


Wellcome 
Trust 
United 
Kingdom 


MRC 


United 
Kingdom 


76% 9% 80% 
(9%) (40%) (11%) (57%) 
53% 6% 12% 2% 
(63%) (8%) (10%) (3%) 


NSF 17% NIH 
United (17%) United 
States States 
24% 46% 40% 
(22%) (42%) (45%) 
17% 9% 27% 7% 
(27%) (13%) (54%) (8%) 
Gates 
Multicountry 
11% 63% 29% 37% 
(17%) (43%) (19%) (19%) 
79% 7% 45% 12% 
(75%) (7%) (46%) (23%) 


7% 
(7%) 


CIHR 


Canada 


7% 
(11%) 


2% 34% 
(7%) (24%) 


*Our analysis counted papers freely available to read on publishers’ websites as gold and those in 
repositories as green. It did not consider conditions of reuse or whether free access happened at the 
same time as publication. Shown over time in Supplementary Information Figure S3. 
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> read. We assessed whether access was 
gold (available on a journal website) or green 
(available in a repository, such as PubMed 
Central, the preprint server arXiv or else- 
where, sometimes with a delay or ‘embargo’ 
of up to a year after publication). About half 
(47.5%) of open papers are both green and 
gold (see “Green and gold’). 

Both the NIH and the Wellcome Trust state 
that they will withhold or suspend payments 
if articles are not made open access, although 
itis unclear whether they have done so. These 
agencies provide convenient repositories for 
depositing articles: PubMedCentral for NIH- 
funded work, and Europe PubMed Central in 
the case of Wellcome. Their policies encour- 
age compliance and allow authors to publish 
in journals that do not permit articles to be 
available immediately without a subscription. 
Although articles must be in a repository at 
the time of publication, free access can occur 
later. For example, a paper with a 12-month 
embargo published in the March 2016 issue 
of a journal would become freely available in 
the repository in March 2017. 

Funders that allow authors to deposit 
papers after publication see lower rates of 
compliance, presumably because authors 
lose track of this obligation. The Canadian 
Institutes of Health Research (CIHR) man- 
dated deposit on publication from 2008 to 
2015, but dropped this requirement when 
the three main Canadian research councils 
adopted a joint, harmonized policy. Com- 
pliance for CIHR-funded studies went from 
around 60% in 2014 to around 40% in 2017. 

Other funders that have lower rates of 
compliance than the NIH and the Wellcome 
Trust provide less enforcement and infra- 
structure. For example, the US National 
Science Foundation (NSF) called for “volun- 
tary compliance” with open-access mandates 
until early 2016 — its rate was around 47% in 
2016. Its repository, which uses infrastruc- 
ture developed by the US Department of 
Energy, has less visibility and fewer articles 
than PubMedCentral does. However, that 
might soon change, because deposition in 
this public repository is now mandatory for 
papers arising from NSF funding awarded 
after January 2016. Compliance at the CIHR 
is hampered by similar barriers. Unlike 
PubMed Central in the United States, Pub- 
Med Central Canada was never the dedicated 
infrastructure for Canadian medical papers. 
The Canadian repository faltered and then 
closed in February this year, and no strong 
environment of enforcement has arisen. 
Factors include lower funding in Canada 
compared to the United States, which makes 
it harder for authors to allocate funds for 
article-processing fees. 

The United Kingdom has seen a steep 
rise in open-access compliance across all 
agencies (see ‘Very varied access’). Rates at 
all four of the UK research councils stud- 
ied went up by at least 20 percentage points 


COMMENT 


ANALYSIS METHODS 


How we mined data on open-access compliance 


We first identified the funding sources 
of papers using the published 
acknowledgements (mandated by most 
funders). These have been indexed by 
the Web of Science (WoS) since 2008 for 
science and medicine, and since 2015 for 
social-sciences articles. There is no uniform 
format, so we looked for variations of 
agency names (such as ‘NSF’ and ‘National 
Science Foundation’) and aggregated these. 
Next, we used Unpaywall, a platform 
that helps researchers to find open-access 
articles. It identifies the population of 
scholarly papers using the list of unique 
digital object identifiers (DOls) registered by 
Crossref, a non-profit indexing organization. 
Unpaywall mines all journal websites listed in 
the Directory of Open Access Journals, along 
with databases such as PubMed Central 
and 50,000 other journal websites and 
repositories. It intentionally excludes papers 
that are available on social-networking sites 
(such as ResearchGate) or illegally (such 


between 2009 and 2016; the Engineering and 
Physical Sciences Research Council went 
up by 50 percentage points. This follows the 
publication of the Finch report in 2012 (see 
go.nature.com/2yojrkc) by a working group 
of academics, funders and publishers that 
was established in 2011 by David Willetts, 
then the UK science minister. It strongly 
recommended increasing access to research 
through article-processing charges and gold 
open access rather than by archiving papers 
in repositories. For the next assessment of 
research institutions in England in 2021, 
major UK funders have now decided to con- 
sider only open-access publications. 


FIELD CULTURE 

We find variations by discipline, with nearly 
full compliance in biomedicine, clinical medi- 
cine and health research. The social sciences, 
chemistry and engineering all show lower 
rates (see ‘Funder effect’). Within the same 
discipline, compliance varies drastically by 
funding agency. For example, in chemistry 
research, 81% of work funded by the NIH 
is publicly available, whereas that is true of 
only around one-quarter of chemistry stud- 
ies supported by the NSF and CIHR. Differ- 
ent funders support different types of work, 
but the variations we found also remain con- 
sistent within sub-disciplines (see Supple- 
mentary Information, Figure S5). Although 
researchers cite norms and needs within dis- 
ciplines as a reason not to comply with open- 
access mandates, we believe that the funding 
agency is a stronger driver of open access than 
is the culture of any particular discipline. 


as on Sci-Hub). As of April 2018, Unpaywall 
provided the open-access status of nearly 
96 million scholarly documents. 

Of the 12,495,074 journal articles we 
matched with Unpaywall using DOls, 
1,352,918 acknowledged funding from 
1 of the 12 funders we identified. 

To determine rates of compliance, we 
matched Unpaywall data to our set of WoS 
articles and analysed them by funder and 
discipline. WoS includes papers published 
in about 12,500 journals annually, so some 
funded work is in journals not covered 
by our analysis, especially in the social 
sciences and humanities. Our ability to 
assign funders to papers Is imperfect, 
given the various ways in which funder 
names appear and because authors do not 
always provide funding information. Rates 
of estimated compliance are likely to be 
conservative; there might be funded papers 
that are freely available online but which 
could not be found by Unpaywall. V.L. & 6.8.5. 


NEXT STEPS 
If funding agencies have their own data on 
compliance, the information should be 
openly published so that it can be used in 
assessments of the march of open access, such 
as ours. That would also allow comparisons 
to be drawn. Future research on compliance 
with open-access mandates should evalu- 
ate the utility of other data sources, such as 
Scopus, lfindr, Kopernio and Dimensions 
(run by Digital Science, a firm operated by 
the Holtzbrinck Publishing Group, which 
has a share in Nature’s publisher. Nature is 
editorially independent of its publisher). We 
must also create stronger reporting systems so 
that these data are more readily available for 
analysis. This involves collaboration between 
funders, publishers and indexers. Reporting 
should allow for analyses at the level of funded 
projects, which would provide information 
on the time between funding and open dis- 
semination. On a broader level, more research 
is needed to understand what makes scientists 
comply with funder mandates and why. 
Ultimately, open access needs a sustain- 
able financing model. Libraries and other 
organizations have historically borne the 
cost of publishing through subscription 
fees. Gold open access displaces those costs 
on to authors (who often need to allocate 
funds from their research budgets to cover 
publishing), even as libraries continue to 
shell out for subscription fees. The cost of 
publishing in open-access journals ranges 
from less than US$100 to more than $5,000 
per article, with dominant publishers such 
as Elsevier averaging $2,612 per paperin > 
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article-processing charges and Springer 
Nature (which publishes Nature) averaging 
$1,913 (see go.nature.com/2cn3zuy). The 
system as a whole risks charging multiple 
actors for the same product, and could price 
some places and people out of publishing. 

Advocacy must be balanced with evidence 
in the open-access debate. Our research 
demonstrates that funders can clearly shape 
compliance through their mandates, and 
that this compliance needs to be monitored. 
Real barriers — such as infrastructure and 
funding — must be overcome to make 
mandates efficient. However, the rhetoric 
surrounding disciplinary barriers might 
be more a myth than a reality: when the 
proper structure and incentives are in place, 
researchers comply. 


To move the conversation forward, we 
need a greater sense of the implications 
of open access on the scientific system’s 
financial structure. We must study how 
certain publishing models will put pres- 
sure on some parts of the system while 
alleviating it from other areas, or even 
enriching them. We need to ensure that 
the mandates are sensitive to financial 
inequity across countries, disciplines, 
institutions and researchers. 

Universities, industry and funding 
agencies should think collectively about 
robust and scalable models. Cooperation 
and foresight are the only ways to ensure that 
everyone has open access to research — both 
for readers who want to consume it, and for 
authors who wish to publish it. m 


Vincent Lariviere is associate professor at the 
University of Montreal and associate scientific 
director of the Observatory of Science and 
Technology, Montreal, Canada. Cassidy R. 
Sugimoto is associate professor of informatics 
at Indiana University Bloomington, USA. 
e-mail: sugimoto@indiana.edu 
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A gene-edited ‘micropig’ was developed in 2015 by the BGI genomics institute in Shenzhen, China. 


Use the patent system to 
regulate gene editing 


Governments should use patents to shape the deployment of CRISPR-Cas9 as they 
have done for past technologies, argues Shobita Parthasarathy. 


} | ext month, researchers, policy- 
makers, ethicists and social 
scientists will meet in Hong Kong 

for the second International Summit on 


Human Gene Editing. 
Since the first summit, held in 


Washington DC nearly three years ago, 
researchers have continued to apply 
the versatile gene-editing technology 
CRISPR-Cas9 to diverse domains — from 
crop enhancement and pest eradication 
to human disease. Many have flagged the 
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ethical, economic and environmental 
concerns raised by manipulating plant and 
animal genomes, including our own. But, so 
far, governments have struggled to develop 
viable approaches to regulation. 

A crucial part of the arsenal for shaping 


IMAGINECHINA/REX/SHUTTERSTOCK 


SOURCES: IPSTUDIES, SWITZERLAND; FAMPAT-QUESTEL 


the future of gene editing is hiding in plain 
sight: the patent system. 

In the past, patents have played an impor- 
tant part in regulating new technologies 
and research, from the atom bomb to work 
involving human embryonic stem cells. 
Some organizations and individual research- 
ers using CRISPR-Cas9 are already creating 
licensing agreements that reflect their own 
moral codes. In my view, government- 
driven efforts centred on national patent 
systems should be deployed to help regulate 
gene editing. 


NEW LAWS NEEDED 

Last year, the US National Academies of 
Science, Engineering, and Medicine recom- 
mended that clinical trials involving gene 
editing in human eggs, sperm or embryos 
should be permitted only for the treatment 
and prevention of serious disease or disabil- 
ity. They also urged that a “stringent over- 
sight” system be developed to limit the use 
of the technology in this context’. In July, 
the Nuffield Council on Bioethics, a highly 
respected bioethics body in the United King- 
dom, similarly stated that the use of heritable 
genome editing “could be ethically accept- 
able” only after appropriate governance 
measures are put in place’. 

These recommendations haven't yet 
translated into legal frameworks or formal 
governance structures. And the history of 
regulating emerging biotechnologies sug- 
gests that such laws could be a long time 
coming, if they end up being formed at all’. 
For now, when it comes to editing genes in 
humans and other organisms, the United 
States and the United Kingdom — along 
with many other countries — rely on laws 
and policies that cover existing genetic- 
engineering technologies. Or, as in the case 
of human germline editing in the United 
States, the government simply bans the use 
of federal funds for such research. 

Such policies have been criticized 
for decades as being inadequate*. Their 
insufficiencies are considerably more 
problematic in the context of gene editing, 
which, largely thanks to the development 
and uptake of CRISPR-Cas9 (see ‘Invention 
protection’), promises to have much greater 
societal impact than previous technologies 
for modifying genomes. 

In the United States, for instance, the 
oversight provided by the ‘coordinated 
framework (developed in the 1980s to deal 
with genetically engineered organisms) han- 
dles only immediate risks. The framework 
covers the management of altered plants 
and animals that have already been created, 
and does not consider the socio-economic, 
ecological or ethical consequences of creating 
organisms not found in nature. Likewise, 
since 2016, a condition in the budget for 
the US Food and Drug Administration has 
prohibited the agency from authorizing 


INVENTION PROTECTION 


The number of patents for inventions involving 
CRISPR-Cas9 has soared in just a few years. 
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Data for 2017 and 2018 are not yet complete; patent applications 
are generally published 18 months after filing at a patent office. 


clinical trials in which a “human embryo is 
intentionally created or modified to include 
aheritable genetic modification” But there is 
nothing to stop US researchers using private 
funds to edit the genes of human embryos 
in the lab. 


HISTORICAL PRECEDENTS 

How could patents help? These legal 
instruments — which give inventors the 
right to prevent others from commercial- 
izing their technologies — are usually seen 
solely as contracts that incentivize innova- 
tion. In fact, they can do much more, directly 
and indirectly. 

They can lead to higher prices for products, 
for instance, and reduce people’s access to 
important technologies if inventors use them 
to establish and 


maintain mono- “Patents 
polies. Perhaps canshape 
most importantly, innovation 


they can shape 
innovation trajec- 
tories. Patent laws were a major factor in the 
‘war of the currents’ in the 1880s, driving peo- 
ple to favour engineer George Westinghouse’s 
alternating current (AC) over the direct- 
current system invented by Thomas Edison. 
(Westinghouse licensed the US patents for 
AC from inventor Nikola Tesla.) The deci- 
sions that governments make about whether 
to grant patents implicitly demonstrate their 
moral approval of an invention and indicate 
what types of technology are likely to generate 
exclusive markets”. 

The idea that governments could use 
patent systems to shape both the develop- 
ment of a technology and its impact on 
society is not new. In the 1940s, the US 
Congress used the patent system to control 
the development and commercialization of 
atomic weaponry. 

To try to reduce the possibility 
of private actors developing atomic 
bombs, or of US intelligence leaking, 


trajectories.” 


Congress created a three-tier system of 
non-patentable, government patentable 
and privately patentable technologies in 
the Atomic Energy Act of 1954 (ref. 6). The 
US Patent and Trademark Office offered 
standard patents for technologies that fell 
into the ‘privately patentable’ category. But 
inventions that would be useful only in the 
production of fissionable material, or when 
using such material or atomic energy in 
a military weapon, were non-patentable. 
The government (specifically, the Atomic 
Energy Commission) could also step in and 
require ‘compulsory licenses’ for technolo- 
gies deemed to be in the public interest. 

Even further back, in the nineteenth cen- 
tury, the governments of several European 
countries, including France, Switzerland 
and Italy, limited or even banned patents 
on foods and pharmaceuticals to ensure 
that people had sufficient access to these 
products’. 


EXISTING FRAMEWORKS 

Biotechnology, including gene editing, is 
already regulated to some degree through 
patents. 

In 1998, the European Parliament and 
Council passed a directive on the legal pro- 
tection of biotechnological inventions. This 
harmonized Europe's approach to patents 
in the emerging field; it covers all Euro- 
pean Union countries and the 38 member 
countries of the European Patent Office. It 
also addressed people's concerns about the 
moral and socio-economic implications of 
individuals being able to obtain patents on 
living entities, such as human embryos or 
genetically engineered plants and animals. 

The directive states that governments 
can grant patents on animals that have been 
modified only if the resulting benefit to 
humankind outweighs the animal's suffer- 
ing. It even includes prohibitions on patent- 
ing processes that could be used to modify 
human sperm, eggs or embryos. 

Moreover, some scientific organizations 
and researchers who use CRISPR-Cas9 have 
themselves recognized the power of patents 
to govern gene editing, and are writing their 
own licensing agreements. 

For example, the Broad Institute of MIT 
and Harvard, in Cambridge, Massachusetts, 
is a non-profit research institution that 
holds expansive patents on CRISPR-Cas9 
technology. It prohibits its licensees from 
using CRISPR-Cas9 to modify human 
embryos, alter ecosystems or modify 
tobacco plants*. Similarly, Kevin Esvelt at 
the Massachusetts Institute of Technology 
(MIT), also in Cambridge, holds a patent on 
a ‘gene drive’ that could be used to spread a 
particular genomic alteration throughout an 
animal population. He requires those who 
wish to license this patent to disclose their 
proposed use, and has suggested that other 
scientists working on gene drives do the 
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Silvana Konermann (left) and Feng Zhang helped to develop CRISPR-Cas9 at hh Broad Institute in Massachusetts, which holds many CRISPR patents. 


same. He argues that this will enable public 
discussion’. 

What I'm calling for, however, is different: 
more-formal, comprehensive, government- 
driven regulation using the patent system. 

This would cover all domains of gene 
editing, not just certain areas of research. It 
would have more transparency and political 
legitimacy than individual efforts ever could, 
by involving government institutions that 
are explicitly charged with representing the 
public interest. And it would enable govern- 
ments to exploit the unique vantage point 
that patent offices have on the early stages 
of scientific fields and industries. (Inven- 
tors usually file patent applications before 
they try to get regulatory approval for new 
technologies.) 

In the United States, Congress could 
authorize a working group to convene an 
advisory committee for gene-editing pat- 
ents. The working group could include: indi- 
viduals from the Environmental Protection 
Agency, who are trained in assessing ecosys- 
tem impacts; staff from the Department of 
Commerce, which oversees the US Patent 
and Trademark Office; personnel from the 
Department of Health and Human Services, 
who have deep understanding of biomedi- 
cal research, health-care costs and research 
ethics; and staff from the Government 
Accountability Office, which in the past few 
years has developed expertise in technol- 
ogy assessment. The advisory committee 
should also comprise scientists, physicians, 
ethicists, social scientists, historians, lawyers 
and representatives from the private sector. 

Building on existing laws such as the 1954 


Atomic Energy Act, the committee could 
put together a regulatory framework for 
reviewing and awarding patents related to 
gene editing. It would need to incorporate 
the perspectives of citizens at every step”®, 
and might place inventions into distinct cat- 
egories. Perhaps the use of CRISPR-Cas9 for 
editing human embryos would not receive 
patent protection, for instance, whereas the 
use of the technology to correct a common 
mutation that causes heart failure would. 

Under such a framework, the committee 
could identify inventions that are likely to 
be so important to the public interest that 
the government should monitor closely how 
associated patents are used and licensed, and 
step in to force broad licensing if a patent 
holder charges too high a price for access to 
their invention. (Currently, the 1980 Bayh— 
Dole Act gives the US government ‘march-ir? 
rights in the case of taxpayer-funded research, 
although it has never been used in this way"’.) 

The EU directive on the legal protection 
of biotechnological inventions already pro- 
vides Europe with some guidance on which 
gene-editing processes and products to 
exclude from patentability’. But 20 years on, 
additional oversight is needed. To develop a 
more detailed governance framework, the 
European Patent Office should convene an 
advisory committee to develop a framework, 
similar to the one proposed for the United 
States. This could then by adopted by the 
European Patent Office and EU member 
countries. 

Ultimately, patent law will need to be 
just one of many regulatory schemes. 
Some developers might still create and use 
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ethically problematic technology, even if 
they are unable to patent it. But existing 
approaches, and the entities that are con- 
ventionally tasked with overseeing areas 
of scientific research, seem ill-equipped to 
address complex societal and value-based 
concerns in an increasingly privatized world. 
Patents, which affect the thousands of inves- 
tigators now using CRISPR-Cas9 in both the 
private and public sector, should be part of 
the mix. m 


Shobita Parthasarathy is professor and 
director of the Science, Technology, and 
Public Policy Program at the Gerald R. Ford 
School of Public Policy, University of 
Michigan, Ann Arbor, Michigan, USA. 
e-mail: shobita@umich.edu 
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To paint Ophelia (1851-52), John Millais spent months observing plants on the banks of the Hogsmill River in Surrey, UK. 


Rebels of art and science 


John Holmes explores the empirical underlay to Pre-Raphaelite masterworks. 


xtraordinary advances in science, 
) ees and industry shaped the 

Victorian age; alongside that grewa 
new experimentalism in literature and the 
arts. From 1848, the Pre-Raphaelite Brother- 
hood, a group of British artists founded by 
Dante Gabriel Rossetti, William Holman 
Hunt and John Everett Millais, began to 
weave science into their art. They sought a 
new aesthetic even as they called for art to 
model itself on science — and were cham- 
pioned by scientific luminaries from the 
comparative anatomist Richard Owen to 
physician Henry Acland. 

The Pre-Raphaelites rejected the 
insistence of the Royal Academy of Arts in 
London that artists should learn by imitating 
the paintings of Raphael. Modern interest in 
the group has grown steadily since a revival 
among the counter-culture of the 1960s. 
That is newly reflected in a retrospective 
on the work of Edward Burne-Jones (who, 
with Rossetti and William Morris, formed 
the ‘second-wave’ Pre-Raphaelite move- 
ment) at Tate Britain in London. Burne- 
Jones’s paintings, such as the 1880s Briar 
Rose series featured in the exhibition, seem 


Edward Burne-Jones 
Tate Britain, London. 
Until 24 February 2019. 


to open a window 
on an exquisitely 
romanticized fan- 
tasy world caught in 
moments of stillness. But John Ruskin, the 
era’s leading art critic and a serious amateur 
geologist and botanist, saw something else 
there. In 1884, he wrote that, although the 
brotherhood’s work might “seem to be the 
reaction of a desperate fancy ... against the 
incisive scepticism of recent science’, they 
were in fact “a part of that science itself” 
Ruskin supported the Pre-Raphaelites and 
knew them well. 

From the start, they insisted on working 
“from absolute data of fact” and acute 
observation, as the critic William Michael 
Rossetti — brother of Dante Gabriel and 
poet Christina Rossetti — explained in 
the magazine The Spectator in 1851. He 
noted that the group conducted “investi- 
gations” through art, and offered the pub- 
lic “unflinching avowal of the result”. A 
year previously, Pre-Raphaelite art critic 
Frederic George Stephens had spelt out 
the group’s commitment to these princi- 
ples in its short-lived periodical The Germ. 


490 | NATURE | VOL 562 | 25 OCTOBER 2018 


© 2018 Springer Nature Limited. All rights reserved. 


Stephens remarked that since the early 
1800s, disciplines such as chemistry had 
made astonishing progress “by bringing 
greater knowledge to bear upon a wider 
range of experiment’, and pursuing pre- 
cision. Why, he asked, shouldn't the same 
methods benefit the arts’ “moral purposes”? 

How did the group harness empirical 
methods to create its work? Take arguably 
the most famous Pre-Raphaelite painting, 
Millais’s Ophelia (1851-25). At first glance, 
this seems a sentimental portrayal of the 
tragic suicide of the character in William 
Shakespeare’s Hamlet. However, every plant 
depicted, from purple loosestrife to wild 
roses, is the product of more than three 
months of painstaking observation as Millais 
worked on the banks of the Hogsmill River 
in Surrey. Other artists had painted in the 
open air before, but never in such meticu- 
lously wrought detail. 

This became a collective experiment to 
discover what painting, pushed to its lim- 
its, could reveal. Each new work would 
press further, recording exact effects oflight 
and shade, as in Hunt’s 1851 The Hireling 
Shepherd; or ecological relationships and 


TATE IMAGES 
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animal behaviour, as in the straying sheep 
of his 1852 Our English Coasts; or skin tones 
in full sunlight, as in Ford Madox Brown's 
The Pretty Baa-Lambs (1851). Subjects 
were scrupulously researched. Hunt visited 
Jerusalem and the Dead Sea to study the 
landscape, people and latest archaeological 
findings for his paintings of the life of Jesus. 

Looking more closely at Ophelia, we 
see a study of physical and psychologi- 
cal phenomena. Millais asked his model, 
Elizabeth Siddall — poet, artist, and later 
Dante Gabriel Rossetti’s wife — to lie ina 
bath fully dressed. Candles were set under 
the bath for warmth. Millais’s concentra- 
tion was such in his epic eight-hour bout of 
painting that he failed to notice when they 
went out; Siddall caught a severe chill and 
he paid the doctor’s bill. Siddall thus paid a 
price for Millais’ ‘laboratory conditions, but 
the method did enable him to capture how 
hair and fabric float on and underneath the 
surface of water. 

Ophelia’s expression in the painting is also 
revealing. The Pre-Raphaelites were rightly 
scathing about the state of psychology in 
1850, when phrenology and physiognomy 
still passed as sciences. Stephens called it 
“dry operose quackery... mere chaff not 
studied from nature, and therefore worth- 
less, never felt, and therefore useless”. They 
set out to study the mind through art instead, 
refining their designs as they thought 
through the mental states of their subjects. 
An early drawing for Ophelia, now in the 
Plymouth City Museum, is melodramatic. 
In the finished painting, we see a much more 
subtle analysis: Ophelia subdued by despair, 
sinking into unconsciousness as she drowns. 
When Stephens argued that scientific meth- 
ods could advance art’s moral purpose, this 
is what he meant: it helps us to understand 
humanity and nature. 

The art establishment was crushing in its 


Ferns carved by James O’Shea top a column at the Oxford University Museum of Natural History. 


opposition to the group, as were mainstream 
journalists. Charles Dickens called Millais’s 
provocative 1849-50 painting Christ in the 
House of His Parents “odious, repulsive, 
and revolting” By contrast, many Victorian 
scientists supported the brotherhood. The 
naturalist William Broderip, who bought 
The Hireling Shepherd, introduced Hunt 
to Owen, founder of the Natural History 
Museum in London (and coiner of the word 
‘dinosaur’ ). Owen became a staunch advo- 
cate of the Pre-Raphaelites, and delighted 
in showing Millais and his children around 
the British Museums natural-history collec- 
tions. In 1881, Hunt painted the magnificent 
portrait of Owen now in the Natural History 
Museum. 

Acland, one of Owen's students and from 
1858 the Regius Professor of Medicine at the 
University of Oxford, was even more central 
to the movement. When he and Ruskin cam- 
paigned in the 1850s for a natural-history 
museum in Oxford, Acland declared that it 
would be decorated on Pre-Raphaelite prin- 
ciples. The stone- 
work was based 


“Many Victorian 

ecionisis on real plants and 
animals, carved in 

supported the 


minute detail by 
unsung heroes of 
Victorian sculpture 
John and James O’Shea. The stone columns 
around the central court were geological 
samples drawn from quarries around Brit- 
ain to illustrate different periods of Earth’s 
history. The court was surrounded by statues 
of scientists at work, from Galileo Galilei to 
James Watt. Rossetti advised on the project; 
Ruskin and Siddall, among others, contrib- 
uted designs. Thomas Woolner, the broth- 
erhood’s only sculptor, with the group’s 
close collaborators John Lucas Tupper and 
Alexander Munro, fashioned the sculptures. 
The Oxford University Museum of Natural 


brotherhood.” 
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History stands as one of the best and most 
surprising collections of Victorian public 
sculpture, and the only one dedicated to 
science. 

Woolner went on to collaborate with the 
architect Alfred Waterhouse. When Water- 
house built the commanding Natural His- 
tory Museum in London for Owen in the 
1870s, they again applied Pre-Raphaelite 
principles. Owen supplied Waterhouse 
with specimens from the vivarium and 
illustrations of extinct animals as models 
for the terracotta menagerie that adorns the 
museums facade. Waterhouse paid tribute 
to Owen by including ancient fauna he had 
described, such as the archaeopteryx and the 
palaeotherium. 

It was also Woolner who made the Pre- 
Raphaelites’ most direct contribution to 
science. When carving a bust of Charles 
Darwin (now in the herbarium at the Uni- 
versity of Cambridge), Woolner alerted the 
biologist to a small protuberance some- 
times visible inside the rim of the human 
ear, known as the auricular tubercle. In the 
1840s, when devising a statuette of Puck 
from Shakespeare’s A Midsummer Night’s 
Dream, Woolner had observed that the fea- 
ture also appears in monkeys, as pointed 
ears. Both Darwin and Woolner recog- 
nized this physiological phenomenon as 
evidence of human evolution from earlier 
primates, and Darwin — dubbing it the 
‘Woolnerian tip’ — mentioned it in The 
Descent of Man (1871). 

Woolner finished the bust in 1869. The 
same year, the astronomer Norman Lockyer 
founded a new periodical — Nature. Lockyer 
had Pre-Raphaelite connections, too: he had 
worked with William Rossetti on an earlier 
journal, The Reader; befriended Hunt; and 
employed Pre-Raphaelite landscape painter 
John Brett to accompany an expedition to 
Sicily to study the solar eclipse in 1870. In 
1878, Lockyer wrote a series of articles on 
‘Physical Science for Artists, setting out 
guidance in optics and assessing paintings 
from the latest Royal Academy exhibition 
on grounds of scientific accuracy. In Nature, 
Lockyer held artists to exacting scientific 
standards, just as Stephens had done in The 
Germ. 

The Pre-Raphaelites launched the most 
radical and ultimately the most influen- 
tial Victorian art movement, inspiring the 
European symbolists and the Arts and Crafts 
movement led by Burne-Jones’s great asso- 
ciate, William Morris. They also took their 
lead from — and shaped the culture of — 
Victorian science, and affected its legacy to 
this day. m 


John Holmes is professor of Victorian 
literature and culture at the University of 
Birmingham, UK. His latest book is The 
Pre-Raphaelites and Science. 

e-mail: j.holmes.1@bham.ac.uk 
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Ben Barres: gender champion 


Marc Freeman lauds the neurobiologist’s posthumously published memoir. 


n unstoppable force of nature, 
Anteee by headwinds, manag- 

ing to will all of us onwards and 
upwards: this was Ben Barres. A highly 
influential neurobiologist and advocate 
for women in science, Barres lived an 
unusually interesting life. He was an openly 
transgender faculty member at Stanford 
University School of Medicine in Califor- 
nia, and a pioneer in understanding the 
functions of glia — the most abundant and 
mysterious cells in the brain. Whether by 
design or accident, along the way he also 
became a hero for people from gender and 
sexual minorities (LGBT+ people), and for 
early-career scientists generally. 

In 2017, Barres died of cancer at the age 
of 63. His posthumously published memoir, 
The Autobiography of a Transgender Scien- 
tist, documents his remarkable life story. 

Born Barbara Barres in mid-1950s New 
Jersey, Barres was a precocious, science- 
loving child who relentlessly pursued 
academic opportunities. But from an 
early age, he never felt comfortable being 
treated as female. As he writes: “internally 
I felt strongly that I was a boy. This was 
evident in everything about my behavior” 
Although educated and trained at a time 
when sexism was rampant in schools and 
academia, his talent, self-confidence and 
drive propelled him swiftly upwards. He 
entered top-notch medical and science 
training programmes, from undergradu- 
ate study at the Massachusetts Institute 
of Technology in Cambridge to a medical 
degree at Dartmouth College in Hanover, 
New Hampshire, a doctorate at Harvard 
University in Cambridge and a post- 
doctoral position at University College 
London (see A. D. Huberman 553, 282; 
2018). 

Barres admits that during his academic 
career as Barbara, especially early on, he did 
not even consider the idea that gender would 
be used to assess anyone’s qualifications or 
limit their opportunities. When a profes- 
sor accused Barres of enlisting a boyfriend 
to complete a difficult problem (because he 
assumed a woman couldn't have done it), 
Barres was incensed — but mainly about the 
accusation of cheating. After he transitioned 
in 1997 and began to live as a man, he came 
to understand this and other episodes as 
personal experiences of pervasive sexism. 
An advocate was born. 

Barres’ promotion of women in science 
is well known, and he recounts some of his 
efforts in the book. In an essay in Nature, 


Neuroscientist Ben Barres died last year. 


‘Does gender matter?’ (B. Barres Nature 
442, 133-136; 2006), he helped to debunk 
the idea of intrinsic gender differences in 
scientific ability. Barres worked relentlessly 
to improve the representation of women 
in all areas of science: as faculty members 
and conference or departmental seminar 
presenters, and in leadership positions. To 
support that aim, he volunteered for count- 
less selection committees, editorial boards 
and grant-reviewing panels, and spoke up. 
He would be combative if needed — and 
that was fun to watch. The effect he had was 
impressive and inspiring. 

More recently, Barres became a vocal 
advocate for trainees. He describes in glow- 
ing terms how grateful he was to his own 
mentors — David Corey and Linda Chun 
at Harvard, and Martin Raff at University 
College London 
— and the power- 
ful impact of their 
approach to science 
on his professional 
growth. He describes 
his joy in mentoring 
young scientists, and 
the thrill of watching 
them succeed; he saw 
mentoring as a sacred 


The Autobiography ; : 

ofa Transgender duty. His reflections 
Scientist on the importance of 
BEN BARRES picking a good men- 
MIT Press (2018) tor and how we should 
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treat postdoctoral fellows should be required 
reading for all academics. An appendix lists 
his trainees, an impressive group including 
many leaders in their fields. These people 
were Barres’ family. 

Barres was best known for his work 
on glial cells, and he describes his lab’s 
major scientific contributions, including 
his surprising discovery that these cells 
release factors that help make synaptic 
connections between neurons. Just as in 
conversation, Barres’ enthusiasm for the 
science and the people doing it leaps out, 
and the details of many key discover- 
ies come fast and furiously. Barres was a 
leader in boosting the status of glia from 
boring support cells for neurons to essen- 
tial nervous-system cells that interact with 
neurons dynamically to help modulate 
their growth, connectivity, function, plas- 
ticity and health. His lab helped to launch 
a now-thriving field. 

It is touching to read Barres’ account of 
the emotional pain he experienced because 
of his gender discordance. He grappled with 
this issue at a time when — just as some still 
do today — certain sectors of society strug- 
gled to talk about homosexuality, never 
mind issues of gender identity. Before he 
transitioned, he writes, there were moments 
when he considered killing himself, a situ- 
ation that is alarmingly common among 
people with gender dysphoria. He felt he 
was faced with the choice of his career or his 
personal happiness. 

Science is hard enough to begin with; I 
cant imagine also having these sorts of life 
pressures. Fortunately, he had wonderfully 
supportive colleagues and friends across 
the scientific community. Because of his 
own past pain, Barres felt he owed it to 
the LGBT+ community to discuss his life 
openly and emphasize how transitioning 
had brought him relief and happiness. 

Barres is as open in memoir as he was 
in life, and the book teaches important, 
deceptively simple lessons. Be yourself; be 
happy; don’t apologize for who you are. Be 
respectful, but be honest and express your 
opinion even (or especially) if it’s not popu- 
lar. Science is exhilarating, and we have a 
responsibility to do it well, with fairness 
to all involved. The Autobiography of a 
Transgender Scientist shows the way. = 


Marc Freeman is a neuroscientist and 
director of the Vollum Institute at Oregon 
Health and Science University in Portland. 
e-mail: freemmar@ohsu.edu 
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Global responsibility 
for publishing costs 


As readers, many scientists in 
Europe will welcome the news 
that most work will have to be 
published in open-access journals 
from 2020 (M. Schiltz Front. 
Neurosci. 12, 656 (2018); see 
also Nature 561, 17-18; 2018). 
But as knowledge producers, I 
fear that many more scientists 
around the globe are likely to be 
disenfranchised by richer nations, 
institutions and funding bodies. 
Open-access publication 
requires authors to pay in the 
region of US$1,000-3,000 (more 
than the cost of many research 
projects in some disciplines). 
Although scientists from low- 
income countries are eligible for 
full-fee waivers, compulsory open 
access will force many others to 
use money intended for research, 
or to publish in low-tier journals 
that still retain reader paywalls. 
In my view, sources of all 
publication fees should be 
recorded — just as funding 
sources are now — so that 
marginalized researchers can 
be identified and rates of waiver 
use tracked. The findings would 
guide realistic fee capping 
by European open-access 
publications. 
John Measey Stellenbosch 
University, Stellenbosch, South 
Africa. 
john@measey.com 


Plan S debate is 
not “a pity” 


Robert-Jan Smits declares it 
a “pity” that arguments about 
academic freedom are stifling 
debate on his ‘Plan S; which 
promotes a radical shift towards 
open-access publishing (see 
Nature 562, 174; 2018). In fact, 
the opposite is happening. 
Spirited debates on the topic 
are ongoing among researchers, 
publishers, librarians, journalists, 
funders and members of 
the public (see, for example, 
go.nature.com/2qtusrb; go.nature. 
com/2coxgrx; go.nature. 
com/2nm2dmgq; go.nature. 


com/2ckhnre; go.nature. 
com/2qw2hv6). We have yet to 
reach agreement on what to make 
of the major European funders’ 
radical shift to compulsory 
open-access publishing by 2020, 
but we continue to explore this 
important issue in good faith. 
Ina Plan S world, the research 
community will need to address 
academic responsibility, the 
future of scholarly societies 
and their journals, and how to 
respect disciplinary differences 
and ensure the high quality of 
publications. We invite Smits 
and all other architects of the 
plan to engage academics in 
constructive discourse on these 
issues. 
J. Britt Holbrook New Jersey 
Institute of Technology, Newark, 
New Jersey, USA. 
Stephen Curry Imperial College, 
London, UK. 
Shina C. L. Kamerlin Uppsala 
University, Uppsala, Sweden 
holbrook@njit.edu 


Bullying: report 
without reprisal 


I appreciate efforts by 
institutions to counteract 
academic bullying (see Nature 
560, 420 (2018); Nature 560, 
529; 2018). They should also 

set up a clear, fair and accessible 
reporting system, with no fear of 
reprisal for the institution or the 
people who have been abused. 

Bullying issues are arguably 
worse for international 
students and scholars than 
for domestic lab members. 
International researchers 
are already disadvantaged 
by visa requirements and 
financial constraints, and 
such abuse exacerbates their 
insecurities over position and 
job prospects — particularly if it 
takes the form of infringement 
of intellectual property and 
unfair authorship positioning on 
publications. 

An efficient reporting system 
for victims would also benefit 
their institutions and funding 
organizations by helping them 
to select a new generation of 
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more nurturing leaders. The 
media could also play a part by 
making it clear in their reporting 
that individual cases are not an 
indictment on an institution's 
overall reputation. This might 
also speed up the handling of 
abuse cases. 

Morteza Mahmoudi Harvard 
Medical School, Boston, 
Massachusetts, USA. 
mmahmoudi@bwh.harvard.edu 


Co-producers: move 
into charity sector 


The James Lind Alliance has a 
14-year track record of involving 
patients, carers and clinicians in 
determining priorities for health 
research (see www, jla.nihr. 
ac.uk). Charity Futures, another 
co-produced research initiative, 
is using the James Lind Alliance's 
consultation process for the first 
time outside medicine. 

We ask charities and donors 
about the research topics that they 
consider the most important (see 
go.nature.com/2pwsre4). Our 
aim is to encourage more research 
into those areas and so enable 
charities and donors to base their 
work on better evidence (see 
also C. Fiennes Nature 546, 187; 
2018). We shall report publicly on 
our findings next year. 

Stephen Bubb Charity Futures, 
London, UK 

Katherine Cowan Brighton, UK 
Caroline Fiennes Giving 
Evidence, London, UK 
caroline.fiennes@giving-evidence. 
com 

Competing interests declared 

(see go.nature.com/2ph6tz3 for 
details). 


Co-producers: frame 
only the questions 


I workat the interface of science 
and public policy, so appreciate 
the importance of public values 
in prioritizing research problems 
(Nature 562, 7; 2018). The 
challenge is to make this happen 
without disrupting the evidence 
base that enables effective 
delivery of solutions. 


There is a general principle here 
that traces back to philosopher 
David Hume's famous distinction 
between ‘is’ and ‘ought; ‘fact’ and 
‘values’ (A Treatise of Human 
Nature, 1739). Ina democracy, 
it makes sense that the way we 
frame what is studied and how 
we respond to results is subject 
to dialogue that affords more 
importance to those affected 
than to those doing the research. 
However, values should frame the 
questions, not the answers. 

This is less clear-cut in 
the social world (where 
co-production has its roots) than 
in the natural world, because in 
the social world observed and 
observer are not so distinct from 
one another. 

The answers need to be 
grounded in the scientific 
treatment of carefully collected 
evidence. 

Peter Calow University of 
Minnesota, Minneapolis, USA. 
pcalow@umn.edu 


Let writers author 
scientific literature 


Footnotes listing individual 
author contributions to research 
papers help to offset ambiguities 
in formal authorship, but are 
easily overlooked. Until due 
credit can be fairly allocated by 
artificial-intelligence algorithms 
(see G. L. Kiser Nature 561, 

435; 2018), I propose confining 
authorship to those who wrote 
the paper. People assessing credit 
for all other functions would then 
be forced to consult the detailed 
contributions list. 

This ‘authorship for authors’ 
scheme would promote scientists’ 
writing skills. A journal article is 
a short story: it needs creativity, 
clarity, structure and pace. Yet 
scientists receive little training in 
narrative and tend to write poorly. 
This impairs progress because 
readers are forced to struggle with 
tedious or confusing text. Let's 
help the scientific literature live 
up to its name. 

Tobias I. Baskin University of 
Massachusetts, Amherst, USA. 
baskin@umass.edu 


NEWS & VIEWS 


SOLAR PHYSICS 


A window onto the Sun’s core 


An experiment has measured the energy spectrum of solar neutrinos associated with 99% of the nuclear reactions that 
power the Sun. The results provide a glimpse into the depths of the solar core. SEE ARTICLE P.505 


ALDO SERENELLI 


nergy is generated in the 
Hires of the Sun through 

sequences of nuclear reac- 
tions in which four protons fuse 
together to form a helium-4 
nucleus. These sequences are 
accompanied by the release of two 
particles known as electron neu- 
trinos. Models suggest that 99% of 
the nuclear energy released by the 
Sun originates from three reaction 
sequences — collectively known as 
the proton-proton (pp) chain — 
that are initiated by the fusion of 
two protons. On page 505, the 
Borexino Collaboration’ reports 
the first complete measurement of 
neutrino fluxes that originate from 
these three sequences, based on an 
analysis of more than 2,000 days of 
data collection. The results help us 
to understand the details of how 
and why the Sun shines. 

Neutrinos interact weakly 
with matter, and therefore escape 
almost unhindered from the Sun’s 
interior, to reach Earth about eight 
minutes later. Solar neutrinos 
therefore provide a direct view 
into the nuclear furnace in the 
Sun's core. The Borexino experi- 
ment (Fig. 1) detects such neu- 
trinos and determines how much 
energy they have by measuring the 
amount of light produced when the 
particles interact with the detect- 
ing agent (an organic liquid, called 
the scintillator, which is kept underground to 
minimize the amount of background radiation 
that can interfere with the neutrino signals). 
In contrast to all other solar-neutrino experi- 
ments, Borexino can measure the energies of 
both high- and low-energy neutrinos, which 
makes it possible to study the structure of the 
solar core using a technique known as neutrino 
spectroscopy. 

Electron neutrinos can change into two 
other types (or flavours) of neutrino, known 
as tau and muon neutrinos, as they travel to 
Earth, a phenomenon known as flavour oscil- 
lation. The Borexino experiment is more 


Figure 1 | The Borexino experiment, Gran Sasso, Italy. A researcher 
stands in a spherical vessel that forms part of the Borexino neutrino detector. 
The Borexino Collaboration’ has used the detector to produce the first 
simultaneous measurement of the neutrino fluxes associated with the nuclear 
reactions that account for 99% of the Sun’s energy. 


sensitive to electron neutrinos than to tau or 
muon neutrinos, and so flavour oscillation 
needs to be accounted for when the measured 
neutrino fluxes are used to calculate the fluxes 
produced in the Sun. Taking this into consid- 
eration, the Borexino collaborators used the 
measured neutrino flux to work out the total 
power generated by nuclear reactions in the 
Sun’s core, with an uncertainty of about 10%, 
and found that this is the same as the measured 
photon output — thus showing that nuclear 
fusion is indeed the source of energy in the 
Sun. This value, calculated for the amount of 
energy produced through nuclear reactions, 
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is comparable with previous” 


results obtained by combining data 
from several neutrino-detection 
experiments, and places the most 
robust and model-independent 
constraints on the source of solar 
energy. 

The findings also have inter- 
esting ramifications for neutrino 
physics. By combining their data 
with predictions from standard 
solar models, the collaborators 
determine a quantity known as 
the electron neutrino survival 
probability (which describes the 
probability that an electron neu- 
trino created in the Sun will also 
be detected as an electron neu- 
trino at the detector) for neutri- 
nos produced in four reactions 
of the pp chain. The calculated 
survival probabilities include the 
best available value for low-energy 
neutrinos, which correspond to 
an energy regime in which flavour 
oscillation is expected to occur 
mostly in vacuum conditions. 
Combined with the survival prob- 
abilities determined for higher- 
energy neutrinos, the findings 
give strong support to our current 
understanding” of neutrino oscil- 
lations — that is, the idea that low- 
energy neutrinos change flavour as 
they propagate through a vacuum, 
and that the oscillations of high- 
energy neutrinos are enhanced by 
their interactions with electrons. 

The new results also shed light 
on a long-standing paradox in solar phys- 
ics, which arises because the chemical com- 
position of the Sun is not well established. 
The most-recent complete spectroscopic 
determinations of the Sun’s metallicity’ (the 
abundance of all solar elements heavier than 
helium) yielded a value that is 35% lower than 
older spectroscopic results’. Intriguingly, when 
numerical models of the solar interior are con- 
structed using the lower value of metallicity 
as a constraint, the simulated properties are at 
odds with our knowledge of the Sun’s interior 
structure (which is well characterized by helio- 
seismological studies’ that analyse oscillations 


VOLKER STEGER/SPL 


produced by waves that propagate through the 
Sun’s interior). But when the older (higher) 
metallicity values are used, the simulations 
reproduce solar properties very well. This is 
known as the solar abundance problem, and 
calls into question the validity of the present 
models of stellar evolution, or of spectroscopic 
methods for determining the Sun’s composi- 
tion, or both. 

However, the relative contributions of the 
three different reaction sequences in the pp- 
chain, determined from the Borexino experi- 
ment, can be used to infer the temperature in 
the solar core —a region that is poorly mapped 
by helioseismological studies. The Borexino 
findings hint at a core temperature that is 
consistent with predictions from models that 
assume high solar metallicity. That said, the 
results are not yet precise enough to provide a 
definite answer to the solar abundance prob- 
lem, because neutrino fluxes predicted by both 
the high- and low-metallicity solar models are 
compatible with the new results. 

Nevertheless, the Borexino experiment 
might provide a definite answer in the future. 
About 1% of the Sun’s nuclear energy is pro- 
duced through chains of nuclear reactions 
known as CNO cycles*. These cycles are 
catalysed by the presence of carbon, nitrogen 
and oxygen, and so their efficiency depends 
linearly on solar metallicity. If the neutrino 
fluxes associated with CNO cycles could be 
measured, then the abundances of these ele- 
ments in the solar core could be determined. 

Such measurements have proved difficult 
at Borexino so far, because of background 
radiation produced by the radioactive decay 
of bismuth-210 (which forms from the decay 
of uranium-238, an isotope present in tiny 
quantities in all matter in the Solar System). 
Modifications to the vessel that holds the liquid 
scintillator have now been made’ that should 
address this issue. The detection of CNO neu- 
trinos would not only allow the Sun's metallicity 
to be determined, but would also provide direct 
evidence that CNO cycles occur in nature. This 
is important, because CNO cycles are thought 
to be the main mechanism by which stars more 
massive than the Sun generate energy’. 

Another major issue in astrophysics is the 
proposed existence of non-standard mecha- 
nisms for the production or loss of energy in 
stars’”. If such a mechanism exists, there will 
be an imbalance between the solar production 
rate of nuclear energy and luminosity (the total 
amount of energy radiated as photons from 
the Sun's surface). The precision with which 
the power generated by nuclear reactions in 
the Sun can be measured would need to be 
increased tenfold to 1% to allow tests of such 
non-standard particle physics. Such precision 
may be out of reach for Borexino, but it might 
be possible in future large-scale neutrino and 
dark-matter detectors. = 


Aldo Serenelli is in the Astrophysics and 
Planetary Sciences Department, Institute 


of Space Sciences (CSIC), and at the 
Institut d’Estudis Espacials de Catalunya, 
Bellaterra 08193, Spain. 
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How to lose your 


inheritance 


In developing embryos, molecular and physical differences divide the cells that 
will form eggs or sperm and those that will form the body. The mouse protein 
OTX2 directs this decision by blocking reproductive-cell fate. 


DIANA J. LAIRD 


cells destined to contribute to the next gen- 

eration of an organism are set aside early 
in embryonic development. It is unclear why 
primordial germ cells (PGCs), the precursors 
of eggs and sperm, are established so early in 
the development of many multicellular organ- 
isms. This process of establishing the germ line 
involves both preventing a non-reproductive- 
cell (somatic-cell) fate and activating a cellular 
state known as pluripotency — the ability to 
give rise to the many different cell types in the 
body. Understanding how the germ line forms 
is a key requirement for the goal of generat- 
ing healthy eggs and sperm in vitro for fertility 
treatment in the clinic. Work in this area has 
focused mainly on identifying proteins that 
specify germline fate, but comparatively little is 
known about why somatic cells do not acquire 
such a fate. Writing in Nature, Zhang et al.' 
reveal that the formation of PGCs in mice can 
be blocked by a protein called OT X2. 

In many animals, specification of the germ 
line operates like a hereditary monarchy 
in which, like the direct transmission of the 
crown jewels from one generation to the next, 
the passage of molecular components in the 
cytoplasm down the generations determines 
the cells that will form PGCs. However, some 
animals, including salamanders, crickets, 
mice and possibly humans, take a different 
approach. In the early mouse embryo, designa- 
tion of the germ line occurs as a result of cells 
being in the right place at the right time, rather 
than inheriting the species’ crown jewels. In 
this inductive fate-determination scenario, 
cells of a cylindrically shaped region of pluri- 
potent cells known as the epiblast are coaxed 
into adopting a PGC fate by signals from 


IE an ultimate act of family planning, the 


supporting extraembryonic tissues adjacent 
to the embryo. 

In this mode of germline formation, the 
instructive cues that travel between cells 
include proteins of the Wnt and BMP families”. 
PGCs normally form at a predictable location 
in the epiblast (Fig. 1). However, in vivo graft- 
ing experiments in mouse embryos revealed 
that cells from elsewhere in the epiblast have 
the capacity to become PGCs if they are trans- 
planted to that location’. The search for the 
components that drive germline fate in epiblast 
cells in response to the ‘kingmaker’ BMP pro- 
teins identified the transcription-factor pro- 
teins BLIMP1, PRDM14 and AP2y (refs 4, 5). 
This trio of proteins not only drives the expres- 
sion of genes required to make the germ line, 
but also blocks the expression of genes associ- 
ated with a somatic-cell fate*”. 

The process of PGC development can be 
recapitulated in vitro, starting from mouse 
embryonic stem (ES) cells that can be induced 
to form epiblast-like cells. If these epiblast- 
like cells are exposed to BMPs and certain 
other factors, then as many as 13.5% of the 
cells form PGCs°. However, if such epiblast- 
like cells are engineered to express BLIMP1, 
PRDM14 and AP2y, more than 30% form 
PGCs without the requirement* for BMPs. 
Yet knowing that a particular pathway can 
drive the formation of PGCs doesn’t answer 
the questions of whether the default pathway 
of cellular differentiation in embryonic devel- 
opment is to form germline or somatic cells, 
or whether all of the cells in the epiblast are 
equally capable of becoming germline cells. 
Both matters have implications for our under- 
standing of the evolution of multicellularity, 
as well as for our ability to generate healthy 
eggs or sperm from stem cells for clinical 
applications. 
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Figure 1 | Expression of the transcription factor OT X2 declines in cells that will form primordial 
germ cells. Zhang et al.' report how primordial germ cells (PGCs), the precursors of egg and sperm 

cells, form in vivo in mice from cells in an embryonic region called the epiblast. The formation of PGCs 
requires a signal from BMP proteins in the adjacent extraembryonic region’. The authors report that 
there is a fall in OT X2 expression in the epiblast region where PGCs will subsequently arise. The drop 

in OTX2 expression in these cells is followed by a rise in the expression of other transcription factors 

— BLIMP1, PRDM14 and AP2y — that are involved the formation of PGCs**, which form in an area 
adjacent to a layer of embryonic cells called the mesoderm. The authors’ results indicate that OTX2 blocks 


the establishment of PGCs. 


Zhang and colleagues studied OTX2, a 
transcription factor known to be involved 
in the development of the nervous system. 
In vitro studies had revealed’ that OTX2 pro- 
motes the transition of mouse ES cells into 
epiblast-like cells. Zhang et al. scrutinized the 
expression of OT X2 in mouse stem cells that 
were differentiating into germline cells in vitro. 
They found that, in epiblast-like cells progress- 
ing towards PGC formation, OTX2 levels 
decline 12-24 hours before BLIMP1, PRDM14 
and AP2y are upregulated. The authors’ in vivo 
analysis in mice produced similar results 
(Fig. 1). Considered together, these patterns 
of expression make sense: the level of OTX2 
is low in PGCs and ES cells, both of which 
exist in a state of pluripotency termed naive 
pluripotency, which is defined as the ability 
to give rise to all types of cell in the body. By 
contrast, the level of OT X2 is high in the epi- 
blast and in ES-cell-derived epiblast-like cells, 
which are considered to be more restricted in 
the number of cell lineages they can form. 

The timing of the OT X2 decline before the 
transition from epiblast-cell to PGC fate raises 
the question of whether OTX2 might prevent 
cells from transitioning into PGCs. To investi- 
gate this, Zhang et al. studied the formation of 
PGCs in the absence of OTX2. Compared with 
the situation for wild-type cells, elevated num- 
bers of PGCs were found if embryos lacked 
the Otx2 gene, or if epiblast-like cells grown 
in vitro were Otx2-deficient. Conversely, if 
epiblast-like cells were engineered to express 
higher than usual levels of OT X2, the forma- 
tion of PGCs was prevented. 

Moreover, the authors found that the 


acquisition of PGC fate in the absence of Otx2 
could be accomplished in vitro without the 
usual requirements of BLIMP1 and signalling 
molecules called cytokines. This was unex- 
pected because in vivo and in vitro experi- 
ments*” have indicated that PGC production 
requires BLIMP1. How is the requirement for 
BLIMP] in germline-cell formation bypassed? 
Given that OTX2 promotes a form of pluri- 
potency in the epiblast that is more restricted 
than that of naive pluripotency’, perhaps the 
primary function of BLIMP1 is to create a 
naive state of pluripotency for newly forming 
PGCs. 

The authors’ results are consistent with a 
model in which BMPs, possibly acting through 
Wnts, cause the level of OT X2 to decline. This 
reduction in OT X2 is one of the earliest known 
steps that determine whether an epiblast cell 
will acquire a germ-cell or somatic-cell fate. In 
an intriguing parallel to the system in fruit flies 
and worms, in which transcription is repressed 
altogether and cytoplasmic ‘crown jewels’ are 
inherited”, the first commitment to a PGC fate 
in mice is now revealed to be a process in which 
cells are defined by what they are not. Extin- 
guishing the transcription factor OT X2 pre- 
vents it from driving gene expression associated 
with the more-restricted state of pluripotency of 
the epiblast’, instead making way for the naive 
pluripotency needed for PGCs. 

The authors show that the absence of OTX2 
drives PGC formation, but it remains to be 
determined whether these PGCs differenti- 
ate normally and form fully functional eggs 
and sperm. Might the levels of OT X2 affect 
the fitness of germline cells? For example, 
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would germline cells that have low levels of 
OTX2 be less likely to successfully contribute 
to reproduction than cells in which OTX2 has 
been completely extinguished? Conversely, 
does the level of OT X2 affect the fitness of 
somatic cells? 

Many other questions remain to be 
answered. How do BMPs and Wnts cause a 
decline in OTX2 levels? What is the regula- 
tory relationship between OT X2 and other 
transcription factors that act in developing 
PGC? In the development of the mouse eye”, 
OTX2 drives the expression of BLIMP1 ina 
process linked to a switch between two types 
of cell fate, yet in the PGCs, BLIMP1 seems to 
function after OT X2 levels have declined. 

In the developing ear of the fruit fly Dros- 
ophila melanogaster”, the transcription of 
OTX2 is directly regulated by the transcription 
factor N-myc. The known roles of Myc-family 
genes in metabolism and in a process known 
as cell competition” suggest that even minor 
variations in OTX2 levels in the epiblast might 
reflect a biologically meaningful variation in 
the fitness of cells that will go on to form the 
somatic-cell lineages or the germ line. In other 
words, ifit turns out that the levels of Myc cor- 
relate with the health of an epiblast cell and its 
ability to outcompete its neighbouring cells, 
and if Myc controls the level of OT X2 in mice, 
then OTX2 might have a role in the fitness of 
epiblasts, and perhaps in that of PGCs. 

OT X2 could provide an avenue for inves- 
tigating the cell-fate decision between form- 
ing germline and somatic cells, and might 
offer ways of improving PGC differentiation 
in in vitro approaches. If this mechanism of 
OTX2-mediated regulation of PGC fate in 
mice is evolutionarily conserved, then perhaps 
similar progress might be made in such studies 
of human cells. m 
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Parrot patrol 


Understanding the movements of animals 
can help to focus conservation efforts on 
key regions of habitat. A standard approach 
to tracking small animals is to tag them 
with a tiny radio transmitter and monitor 
the emitted signals using a hand-held 
device. But this can prove challenging when 
following highly mobile animals across 
difficult terrain. 

Writing in Science Robotics, Cliff et al. 
report their analysis of the use of 
autonomous aerial vehicles to track 
tagged birds (O. M. Cliff et al. Sci. Robot. 3, 
eaat8409; 2018). They monitored wild swift 
parrots (Lathamus discolor, pictured), an 
endangered Australian species. The authors 
report that these drones can estimate 
birds’ positions as rapidly as can humans 
experienced at using the standard manual- 
tracking method. Mary Abraham 


OPTICAL PHYSICS 


Precise control of 
infrared polarization 


A natural material has been discovered that exhibits an extreme optical property 
known as in-plane hyperbolicity. The finding could lead to infrared optical 
components that are much smaller than those now available. SEE LETTER P.557 


THOMAS G. FOLLAND & 
JOSHUA D. CALDWELL 


yperbolic materials are highly 
H reflective to light along a certain axis 

and reflective to light along a perpen- 
dicular axis. Typically, one of these axes is in 
the plane of the material and the other is out 
of the plane. A material in which both of these 
axes are in the plane would enable, for exam- 
ple, the manufacture of ultrathin waveplates — 
optical components that alter the polarization 
of incident light. Moreover, the reflective 
behaviour of this material would allow light 
to be confined and manipulated at extremely 
small scales (less than one-hundredth the 
wavelength of the light). On page 557, Ma et 
al.' report the existence of such in-plane hyper- 
bolicity in the natural material molybdenum 
trioxide. 

Many crystals exhibit birefringence, in 
which their refractive index — a measure of 
the speed of light in a material — is different 
along different axes. This property can be used 
to manipulate the polarization of incident 


light. The crystal size that is required to achieve 
sufficient polarization control for practical 
applications is directly proportional to the 
wavelength of the incident light and to the 
strength of the birefringence. Consequently, 
for light in the mid- to far-infrared regions 
of the electromagnetic spectrum (with wave- 
lengths of 3-300 micrometres), the crystals 
typically need to be a few millimetres thick’. 
To overcome this requirement, a potential 
solution is to consider materials that exhibit 
hyperbolicity, which is an extreme form of 
birefringence. 

Hyperbolicity was originally thought to exist 
only in artificial materials consisting of inte- 
grated reflective and transparent domains’. But 
in 2014, it was observed in the natural mate- 
rial hexagonal boron nitride*”. The reflective 
behaviour of both this material and molybde- 
num trioxide is derived from crystal-lattice 
vibrations, known as optical phonons, that 
oscillate in a highly anisotropic (direction- 
dependent) way. These phonons have rela- 
tively long lifetimes (in excess of a picosecond; 
1 psis 10°’ s), which strongly suppresses the 
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absorption of light by the material®. Since the 
discovery of hyperbolicity in hexagonal boron 
nitride, a broad array of natural hyperbolic 
materials has been identified’. 

Preliminary investigations of molybdenum 
trioxide were reported earlier this year® and 
showed the existence of hyperbolicity for long- 
wave infrared light (with wavelengths of 8-14 
um). Ma and colleagues have now demon- 
strated and characterized in-plane hyperbolic- 
ity for the same spectral range. They used this 
property to confine light to dimensions sub- 
stantially smaller than its wavelength, through 
the formation of hybrid light-matter excita- 
tions called hyperbolic phonon polaritons. The 
authors report lifetimes for such polaritons of 
up to 20 ps, which is about ten times longer 
than the best values reported for hexagonal 
boron nitride’. 

Because the crystal structure of molybdenum 
trioxide is highly anisotropic, all three crystal 
axes, which define the edges of the crystal’s unit 
cell, have different lengths. Consequently, there 
isa large difference in the phonon energies asso- 
ciated with these axes and therefore in the cor- 
responding refractive indices — resulting in a 
birefringence of about 0.31. It should be noted 
that, earlier this year, a similarly large in-plane 
birefringence of 0.76 was reported in the natu- 
ral material barium titanium sulfide for mid- 
infrared to long-wave infrared light'®. However, 
hyperbolicity was not observed for this material. 

The in-plane hyperbolicity of molybdenum 
trioxide offers opportunities to replace con- 
ventional optical components with ones that 
are much smaller. In particular, using the large 
in-plane birefringence of this material (or of 
barium titanium sulfide), infrared waveplates 
could be constructed from thin slabs that have 
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50 Years Ago 


Clearance by Carp 

Britain spends about £2.5 million 

a year on removing water weeds 
from inland waterways, and in 1964 
the British River Authorities spent 
on average £69 per mile on weed 
clearance. In an attempt to cut these 
costs the Ministry of Agriculture, 
since 1964, has been experimenting 
with the grass carp, a fish native to 
China, but widely cultivated... asa 
means of biological control of water 
weed. Although the experiments 
are still at a very early stage, no 
major snags have occurred so far 
and the ministry's Salmon and 
Fresh Water Fisheries Laboratory 

is still optimistic that the method 
may work. 

From Nature 26 October 1968 


100 Years Ago 


There is a general belief that it 

is a relatively easy problem to 
estimate a person's intelligence 

by looking at him; and teachers, 
physicians, and employers are often 
compelled to make judgments as 

to the intelligence of a given person 
with no more data than can be 
obtained from a rapid survey of his 
appearance ... In the Psychological 
Review... Mr. R. Pinter gives 

the results of an investigation he 
made for the purpose of testing 

the trustworthiness of these 
judgments. The author chose 
twelve photographs of children 
varying in intelligence from proved 
feeble-mindedness to unusually 
great ability and asked groups of 
people to arrange the photographs 
in order of merit for intelligence. 
His groups consisted of physicians, 
psychologists, teachers, and 
miscellaneous people. He found 
that... on the judgment of no one 
group or of no one person could any 
reliance be placed ... The author 
concludes that... these haphazard 
judgments are too untrustworthy to 
be of practical value. 

From Nature 24 October 1918 


Conventional 
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Polarization 
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Figure 1 | Manipulating infrared polarization. Ma et al.' show that the material molybdenum trioxide 
can be used to precisely control the polarization of infrared light. a, Optical components known as 
waveplates can convert linearly polarized light into circularly polarized light. In the infrared, a waveplate 
made of a conventional material requires a thickness in excess of 1 millimetre. This material could be 
replaced with a thin slab of molybdenum trioxide, with a thickness on the order of tens of micrometres. 

b, Components called polarizers can convert unpolarized light (in which the polarization points in all 
directions) into linearly polarized light. In the infrared, polarizers made from conventional materials 
typically need to be thick and use a large grid of metal wires. Such a structure could be replaced with a thin 
film of molybdenum trioxide that requires essentially no fabrication. c, Nanoscale photonic structures 
made from conventional materials can emit unpolarized infrared light. But if molybdenum trioxide were 


used, linearly polarized emission could be achieved. 


thicknesses on the order of tens of micrometres 
(Fig. 1a). Such waveplates could operate in the 
long-wave infrared, for which commercial 
waveplates are not widely available and have 
thicknesses in excess of 1 mm. 

Furthermore, using the material’s in-plane 
hyperbolicity, polarizers — components that 
extinguish undesired polarizations of incident 
light — could be made from simple 1-m-thick 
films (Fig. 1b). Previously, polarizers needed to 
be thicker and typically required a large grid of 
metal wires to be patterned on their surface. The 
remarkable properties of molybdenum trioxide 
could therefore greatly reduce both the size and 
the cost of optical components, offering broad 
applicability in thin, compact infrared devices. 

Beyond conventional optics, the properties 
of molybdenum trioxide could lead to advances 
in the realm of nanophotonics, which focuses 
on confining light to nanoscale dimensions. In 
the long-wave infrared, where the hyperbolic- 
ity of this material is observed, nanoscale light 
confinement necessarily implies defeating the 
diffraction limit — the usual restriction that 
light cannot be squeezed into dimensions much 
smaller than its wavelength. Molybdenum tri- 
oxide can beat this limit and, as a result, presents 
opportunities for producing improved infrared- 
emitting devices. 

For instance, heating nanoscale photonic 
structures made from materials that can sup- 
port polaritons can produce light of one or 
more specific frequencies — rather than light 
of a broad range of frequencies that that emit- 
ted by, for example, conventional light bulbs. 
Such structures provide an optical source that 
is akin to light-emitting diodes, but that can be 
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designed to operate anywhere in the infrared. 
The emitted light from these photonic struc- 
tures is usually unpolarized (Fig. 1c). Itis only 
through the use of materials that exhibit in- 
plane hyperbolicity that light ofa single, pure 
polarization can be generated. 

Finally, hyperbolic materials such as molyb- 
denum trioxide could serve as the basis for 
hyperlenses — lenses that produce magnified 
images of objects smaller than the wavelength 
of the imaging light. They could also be used 
in heterostructures (structures in which layers 
of different materials are combined) to make 
nanophotonic components that have control- 
lable properties”. 

Ma and colleagues have demonstrated that, 
once again, nature has more in store for us than 
we thought. The future of nanophotonics was 
once considered to be in the realization of arti- 
ficial materials, but this study and others in the 
past few years have demonstrated that, in many 
cases, the best approach for finding advanced 
materials is to look among the vast array of nat- 
ural materials. The results of these studies offer 
substantial advances in the fields of infrared 
optics and nanophotonics that could enable 
infrared imaging and detection to become as 
ubiquitous as its visible counterpart — a vision 
that would enable imaging through smoke for 
first responders, near-instant medical diagnos- 
tics and enhanced chemical spectroscopy. = 
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Transforming the global 


food system 


Can the predicted rise in global food demand by 2050 be met sustainably? A 
modelling study suggests that a combination of interventions will be needed to 
tackle the associated environmental challenges. SEE ARTICLE P. 519 


GUNTHER FISCHER 


he global population in 2010 was 
| estimated to be 6.9 billion people, and 
by 2050 is predicted to reach between 
8.5 billion and 10 billion people’. This increase 
would bring a corresponding rise in food 
demand that would affect the environmental 
toll that food production exerts on the planet. 
On page 519, Springmann et al.’ report their 
analysis of the environmental pressures that 
would arise in a projected scenario for the 
global food system in 2050. They also mod- 
elled the effects of implementing approaches 
to lessen the environmental consequences of 
food production. 

Food security has long been a challenge 
for human societies, and is a pressing global 
issue. Indeed, many targets related to this area 
are part of the United Nations’ Sustainable 
Development Goals’, which include eradicat- 
ing hunger, ending poverty and combating 
climate change. Achieving a sustainable global 
food system clearly requires progress on social, 
economic and environmental fronts. 

Springmann and colleagues built a model to 
assess the projected global demand for agricul- 
tural products by 2050 on a country-by-country 
basis, given the expected changes in population, 
income levels and dietary preferences by that 
time. It has been predicted*that global income 
in 2050 will be 3-4 times higher than it was in 
2010. The authors’ projections of future food 
consumption were based on established statis- 
tical associations between food demands and 
changes in income or population. These predict 
that, by 2050, there will be less undernutrition, 
a shift towards greater global consumption of 
livestock-based products and a fairly constant 
intake of staple crops per person. 

The authors assessed predicted global 
environmental impacts for the projected food 
production by mid-century. They focused on 
five environmental pressures: the greenhouse- 
gas emissions associated with agricultural pro- 
duction; the use of land for crop production, 


given the associated consequences (such as 
carbon or biodiversity losses) that might 
accompany land-use changes; the demand for 
water to irrigate crops; and the application of 
either nitrogen- or phosphorus-based fertiliz- 
ers, respectively. It is important to consider 
fertilizers because of the greenhouse-gas 
emissions that are linked to their use, and the 
possibility that they might contaminate soils 
or aquatic ecosystems. 

Springmann et al. compared the projected 
environmental impacts in 2050 to a proposed 
set of planetary boundaries thought to repre- 
sent safe operating limits for human activities’. 
For example, the boundary set by the authors 
for agricultural greenhouse-gas emissions was 
established in relation to the threshold neces- 
sary to keep global warming at a level of 2°C 
above pre-industrial levels. However, their 
limit for emission levels is less stringent than 
the limit needed to achieve the 1.5 °C target set 
in the United Nations Framework Convention 


2 
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Figure 1 | Discarded food waste in British Columbia, Canada. This food did not reach consumers. 
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on Climate Change Paris Agreement of 2015, 
which was analysed in a recent report® by the 
Intergovernmental Panel on Climate Change. 
This report details how limiting warming to 
1.5°C rather than to 2°C above pre-industrial 
levels would reduce the climate-related risks 
to health, livelihoods, food security and water 
supply. On the basis of current food yields 
and agricultural practices, Springmann and 
colleagues estimate that, between 2010 and 
2050, the environmental impacts of the food 
system could increase by between 50% and 
92% and reach levels that exceed the proposed 
boundaries’ for planetary stability. 

The researchers went on to assess the effect 
of possible interventions that could reduce 
these environmental pressures. These meas- 
ures relate to managing food demand and 
raising food-production efficiency in terms of 
three broad intervention categories. 

One intervention category concerns 
improvements in agricultural technologies 
and resource management. These could 
enhance production efficiency and increase 
crop yields per unit of land, given a particular 
water and nutrient input. Another category 
was dietary changes, whereby individuals 
might limit their meat consumption and 
move towards plant-based foods. Meat pro- 
duction usually requires a more intensive and 
environmentally damaging mode of produc- 
tion than that needed for plant-based food’. 
Moreover, limiting meat and sugar consump- 
tion and eating fruit and vegetables is aligned 
with nutrition guidelines for a healthy diet’. 
The third category the authors considered was 
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reduction of food-chain waste from field to 
plate. It is estimated that up to one-third of 
food doesn't reach the market (Fig. 1) or is dis- 
carded after purchase’. Reducing this waste 
would increase food availability without the 
need for extra food production. 

Springmann and colleagues conclude that 
an intervention in only one of the three cat- 
egories they analysed would not achieve 
planetary sustainability across all five of the 
environmental domains that they assessed. 
Instead, a bundle of interventions in all three 
categories would be needed to ensure that 
the global food system could be sustainably 
supported by the planet in 2050. They found 
that the projected greenhouse-gas emissions 
from agriculture would not be supportable 
unless global meat consumption was reduced. 
They also report that the expansion of crop- 
land and water use would be best counteracted 
by improvements in agricultural technolo- 
gies and management approaches that bring 
farming yields closer to the maximum yield 
efficiency that is ecologically possible. In addi- 
tion, their analysis indicates that achieving 
fertilizer-use reduction would require a com- 
bination of measures that improve farming 
practices and decrease food demand. 

There are some caveats regarding Spring- 
mann and colleagues’ scenarios. For example, 
they did not take climate-change effects into 
account in their projections of future agricul- 
tural production, and such impacts should be 
a priority for future analysis. Also, the authors’ 
analysis did not consider the world’s grassland 
areas, even though they represent more than 
double the area of global cropland’*. These 
grassland areas should be considered when 
setting planetary boundaries for land use. 
Moreover, Springmann and colleagues’ study 
analyses only the environmental impacts of 
cropland-based food production — it doesn’t 
assess how to balance these impacts with those 
in sectors such as energy, transport or industry. 

Nevertheless, the authors’ analysis is 
valuable and informative for the discussion 
about how to achieve a sustainable food system 
that meets future needs, even if some of the 
planetary-boundary values they used have 
large uncertainty ranges"’. In addition, any 
proposed interventions should not be imple- 
mented using a one-size-fits-all approach. 
Instead, any regulatory frameworks and incen- 
tives will need to be tailored to the needs of a 
given region, whether this means investments 
in education, health-service access, land-use 
regulations or water allocation, for example. 

Springmann and colleagues also did not 
address certain key issues that are needed to 
develop a resilient agricultural system. The 
rights of access to land and natural resources, 
and the long-term security of those rights, is 
needed to motivate investments by farmers. 
Farmers could also be helped by improvements 
in transport, finance and communication infra- 
structure that enable them to access advanced. 
technologies, minimize their production risks 


and target their production for local or inter- 
national markets. 

A recent report” by the Food and Agri- 
culture Organization of the United Nations 
concludes that environmental sustainability 
and food security can go hand in hand by 2050, 
but that substantial investments are needed to 
transform the global food system. Political and 
public commitment will be essential to ensure 
increases in budgets for the development of 
international agriculture. 

Food demand and food production are 
two sides of the global food-system equation. 
Springmann and colleagues’ work provides 
a timely warning that interventions will be 
needed in both domains to achieve food 
security in the future, and to ensure that the 
environmental impacts of the food-production 
system remain within boundaries that Earth 
can sustain. m 
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fake graphene 


The material graphene has a vast number of potential applications — but a survey 
of commercially available graphene samples reveals that research could be 
undermined by the poor quality of the available material. 


PETER BOGGILD 


raphite is composed of layers of carbon 

atoms just a single atom in thickness, 

known as graphene sheets, to which 
it owes many of its remarkable properties. 
When the thickness of graphite flakes is 
reduced to just a few graphene layers, some 
of the material's technologically most impor- 
tant characteristics are greatly enhanced — 
such as the total surface area per gram, and 
the mechanical flexibility of the individual 
flakes. In other words, graphene is more than 
just thin graphite. Unfortunately, it seems that 
many graphene producers either do not know 
or do not care about this. Writing in Advanced 
Materials, Kauling et al.' report a systematic 
study of graphene from 60 producers, and find 
that many highly priced graphene products 
consist mostly of graphite powder. 

Imagine a world in which antibiotics could 
be sold by anybody, and were not subject to 
quality standards and regulations. Many 
people would be afraid to use them because 
of the potential side effects, or because they 
had no faith that they would work, with 
potentially fatal consequences. For emerg- 
ing nanomaterials such as graphene, a lack of 
standards is creating a situation that, although 


502 | NATURE | VOL 562 | 25 OCTOBER 2018 


© 2018 Springer Nature Limited. All rights reserved. 


not deadly, is similarly unacceptable. 

One of the most well-established methods 
for producing graphene for commercial appli- 
cations is liquid-phase exfoliation® (LPE) — a 
process that involves milling graphite into 
a powder, and separating the particles into 
tiny flakes by applying mechanical forces ina 
liquid. Those precious flakes that contain just a 
few layers of graphene are then separated from 
the rest (Fig. 1). Graphene produced in this 
way has a huge number of potential applica- 
tions, including battery technology, composite 
materials and solar cells. The LPE of graphite 
was first achieved using sonication to produce 
the flakes’, and later work showed that even a 
kitchen blender* can be used to create violent 
turbulent forces that pull graphene sheets apart 
without destroying them. 

But how thin must graphite flakes be to 
behave as graphene? A common idea, backed 
up” by the International Organization for 
Standardization (ISO), is that flakes contain- 
ing more than ten graphene layers are basically 
graphite. This seemingly arbitrary threshold 
has some basis in physics, as Kauling et al. 
note. For example, thermodynamic consid- 
erations dictate that each layer of atoms in a 
flake of ten or fewer layers behaves as an indi- 
vidual graphene crystal at room temperature. 
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Figure 1 | Liquid-phase exfoliation of graphene. Most commercially available bulk graphene is made by 
milling graphite into powder, and then subjecting the resulting particles to mechanical forces in a liquid 
solution to separate the powder into flakes, for example, by using sonication; flakes not shown to scale. 
The flakes are then sorted according to their size and thickness. Kauling et al.' analysed commercially 
available graphene from 60 providers, and found that the majority of the samples contained less than 

10% of graphene (flakes that contain fewer than ten layers of carbon atoms’). The rest is essentially just 


graphite powder. (Adapted from ref. 1.) 


Moreover, the rigidity of flakes scales with the 
cube of layer thickness, which means that thin 
graphene flakes are orders of magnitude more 
flexible than thicker graphite flakes. 

So size really matters: depending on the 
practical application, graphene and graphite 
powders can give entirely different results. 
Without clear standards by which to deter- 
mine the quality of commercially available 
graphene, companies and researchers risk 
wasting time and money doing research on 
graphite powder disguised as expensive, high- 
grade graphene. This would stunt the devel- 
opment of graphene technology, harming 
serious graphene producers and application 
developers alike. 

But are these concerns truly warranted? 
Ina study aimed at answering this question, 
Kauling et al. established a systematic test pro- 
tocol based on an arsenal of well-established 
methods for characterizing graphene, and 
then used the protocol to benchmark 60 gra- 
phene products from different producers, a 
daunting task. The results showed that the sta- 
tistical distributions of the key material indi- 
cators — such as the size, structural integrity 
and purity of the graphene — varied greatly. 
Shockingly, the study revealed that less than 
10% of the material in most of the products 
consisted of graphene composed of ten or 
fewer layers. None of the products tested con- 
tained more than 50% of such graphene, and 
many were heavily contaminated, most likely 
with chemicals used in the production process. 

It seems that the high-profile scientific 
discoveries, technical breakthroughs and 
heavy investment in graphene have created 
a Wild West for business opportunists: the 
study shows that some producers are label- 
ling black powders that mostly contain cheap 
graphite as graphene, and selling them for top 
dollar. The problem is exacerbated because 
the entry barrier to becoming a graphene pro- 
vider is exceptionally low — anyone can buy 
bulk graphite, grind it to powder and make a 
website to sell it on. 

Unless common standards and test 


protocols are introduced, there is a great risk 
of dropping the ball at the worst possible time. 
Dozens of emerging applications for graphene 
are closely linked to some of society’s grand 
challenges: health, climate, renewable energy 
and sustainability. Some of these applications 
might never leave the starting block if the early 
development is based on ‘fake graphene’. 
Kauling and colleagues’ article is therefore 
a much-needed wake-up call for graphene 
producers, buyers and researchers to agree on 
and to adhere to sound standards: a transpar- 
ent graphene market would benefit everyone, 
except perhaps unscrupulous vendors. The 
first steps towards this have already been 
taken with the ISO’s graphene vocabulary” 
(a document that defines standard termi- 
nology for describing graphene) and the UK 
National Physical Laboratory’s helpful Good 
Practice Guide for graphene characterization®, 
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Nowit’s time to push on. 

It should be noted that Kauling and co- 
workers’ study does not cover all the types 
of bulk graphene on the market’. Moreover, 
although the authors analysed an impres- 
sive number of LPE-manufactured products, 
they could have eliminated any accusations of 
potential bias by specifying the criteria they 
used to select the products for analysis. It is 
also possible that they unintentionally missed 
high-quality graphene sold by a few excellent 
producers. And, as the researchers mention, 
different applications generally make use 
of different characteristics of graphene — 
which makes it difficult to come up with a 
universal metric of quality. 

Nevertheless, the work is a timely and ambi- 
tious example of the rigorous mindset needed 
to make rapid progress, not just in graphene 
research, but in work on any nanomaterial 
entering the market. To put it bluntly, there can 
be no quality without quality control. m 
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Senescence mediates 
neurodegeneration 


Aggregation of the protein tau is implicated in neurodegenerative diseases in 
humans. It emerges that eliminating a type of damaged cell that no longer divides 
can prevent tau- mediated neurodegeneration in mice. SEE LETTER P.578 


JAY PENNEY & LI-HUEI TSAI 


r There is strong interest in understanding 
how neurodegeneration is affected by a 
cellular state called senescence, in which 

cells stop dividing, suppress intrinsic cell- 

death pathways and release pro-inflammatory 
molecules that can harm healthy neighbours’. 

On page 578, Bussian et al.’ examine the role 

of senescent cells in a mouse model ofa type of 

neurodegeneration that involves aggregation 


of the protein tau. They find that neuronal 
expression of mutant tau triggers senescence 
in glia, the support cells of the brain. Prevent- 
ing the build-up of senescent glia can block 
the cognitive decline and neurodegeneration 
normally experienced by these mice. 
Senescent cells are characterized by various 
molecular and gene-expression changes, 
including elevated levels of the cell-cycle 
inhibitor protein p16". Senescence can be 
identified by a test that stains cells blue if they 
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Figure 1 | Cell crosstalk in neurodegeneration. Mice that have been genetically engineered so that 
their neurons produce a mutant form of the protein tau (tauP301S) model some neurodegenerative 
diseases of humans. a, Bussian et al.’ provide evidence that tauP301S expression leads neurons in these 
mice to release unknown signals that induce a cellular state called senescence in neighbouring cells. As 


a result, genes such as p16'\*“ 


that are associated with senescence are activated in cells called astrocytes 


and microglia, which can proliferate. b, When senescent, astrocytes and microglia have elevated levels 
of p16'“** protein and stop proliferating. They release a group of molecules known as senescence- 
associated secretory phenotype (SASP) factors that, possibly in combination with other regulators of 
tau, signal back to neurons. This leads to the phosphorylation of tau and its aggregation into structures 
called neurofibrillary tangles (NFTs) — two hallmarks of neurodegeneration. 


harbour senescence-associated B-galactosidase 
(SA-B-Gal) — a form of the B-Gal enzyme that 
is active at pH 6 (in healthy cells, B-Gal is inac- 
tive at this pH)’. The cells also secrete inflam- 
matory signalling molecules, growth factors 
and protease enzymes that can impair the 
function, and ultimately the survival, of non- 
senescent cells in their vicinity’. This trait is 
known as the senescence-associated secretory 
phenotype (SASP). 

The gradual build-up of senescent cells 
contributes to ageing in multicellular organ- 
isms’*. Furthermore, senescence can be 
induced by various cellular insults. Senescent 
neurons or glia have been described in people 
with brain injury or neurodegenerative 
disorders such as Parkinson's and Alzheimer’s 
diseases'**®. Strategies have been developed 
that selectively target and eliminate senescent 
cells, counteracting many of the effects of ageing 
and age-related disorders in animal models””*. 
But despite intense study, the exact effect of 
senescent cells in different contexts — includ- 
ing in neurodegeneration — remains unclear. 

Bussian et al. set out to examine the role 
of senescence in neurodegeneration. They 
focused on the aggregation-prone neuronal 
protein tau, which is associated with multiple 
forms of neurodegeneration. For instance, 
a mutation in tau that changes amino-acid 
residue 301 from proline to serine (dubbed 
tauP301S) causes frontotemporal dementia’. 
And, when phosphorylated at abnormally 
high levels, tau forms structures called neuro- 
fibrillary tangles (NFTs) that are a hallmark of 
Alzheimer’s disease’. 

The authors made use of mice that have 
been engineered to express human tauP301S 
in neurons, and so model human tau-mediated 


neurodegenerative diseases. They found 
elevated levels of various senescence-associated 
genes, including p16", in the brains of 
tauP301S-expressing mice compared with 
control animals. Using electron microscopy, 
the researchers examined which types of brain 
cell stained for SA-B-Gal in tauP301S mice. 
They observed no staining in neurons, but 
SA-B-Gal was detected in the two main types 
of glia — astrocytes and microglia. The group 
complemented their electron microscopy with 
an examination of senescence-associated gene 
expression in isolated brain-cell types. This, too, 
provided evidence of senescence in astrocytes 
and microglia, but not in neurons. 
Importantly, Bussian and colleagues found 
that senescence-associated gene expression 
in tauP301S mice increased with age, but pre- 
ceded NFT deposition and neurodegeneration. 
This suggests that the emergence of senescent 
cells could affect the latter two traits. To exam- 
ine this possibility, the researchers eliminated 
senescent cells in the animals as they arose, by 
using a genetic tool that causes expression of 
a cell-death-promoting enzyme specifically 
in cells that produce p16“. Removal of 
senescent cells prevented brain shrinkage and 
thinning of a cognition-related brain region 
called the dentate gyrus — two characteristics 
of tau-mediated neurodegeneration typically 
seen in tauP301S animals. Furthermore, cog- 
nitive function was maintained in tauP301S 
mice lacking senescent cells, whereas 
tauP301S animals in which senescent cells were 
retained exhibited short-term memory defects. 
Perhaps more surprisingly, given that it 
indicates complex crosstalk between neu- 
rons and senescent glia, genetically elimi- 
nating senescent astrocytes and microglia 
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reduced neuronal tau phosphorylation and 
NEFT deposition. Moreover, the authors found 
similar effects when they treated tauP301S 
mice with a ‘senolytic’ compound, which trig- 
gers pharmacological removal of senescent 
cells. Together, Bussian and colleagues’ data 
clearly demonstrate that tauP301S expression 
in neurons can induce senescence in brain 
astrocytes and microglia. In turn, these senes- 
cent glia affect the ability of neurons to regulate 
tau phosphorylation and aggregation (Fig. 1). 
Whether by releasing signalling molecules 
that directly affect tau or through the effects 
of SASP factors (or both), glial senescence 
ultimately promotes neuronal degeneration. 

Bussian and co-workers’ findings point to 
several avenues for future study. First, the sig- 
nals from tauP301S-expressing neurons that 
induce senescence in glia should be defined. 
Similarly, the mechanisms by which senes- 
cent astrocytes and microglia signal back to 
neurons remain to be determined. It will also 
be interesting to understand whether the same 
glia-derived signals affect both tau pathology 
and neuronal survival, and whether astrocytes 
and microglia send the same or distinct signals. 
The answers to these questions are likely to 
have broader implications for understanding 
neurodegenerative diseases more generally. 

Finally, the current study adds to the 
growing body of evidence indicating that 
senolytic treatments could benefit people 
who have a wide range of conditions'”. Of 
immediate interest is whether removal of 
senescent cells can decrease disease sever- 
ity in other animal models of neurodegen- 
eration. The authors removed senescent cells 
throughout the lives of their animals, but it 
will also be valuable to determine whether 
senolytics can have beneficial effects if treat- 
ment is started once a disease has progressed 
to symptomatic stages — a more likely sce- 
nario in humans. Finally, it will be crucial to 
determine whether the processes uncovered 
in this paper are evolutionarily conserved in 
humans. If so, perhaps senolytic treatments 
can benefit people, as promised by this and 
other mouse studies’. m 
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About 99 per cent of solar energy is produced through sequences of nuclear reactions that convert hydrogen into helium, 
starting from the fusion of two protons (the pp chain). The neutrinos emitted by five of these reactions represent a unique 
probe of the Sun’s internal working and, at the same time, offer an intense natural neutrino beam for fundamental physics. 
Here we report a complete study of the pp chain. We measure the neutrino-electron elastic-scattering rates for neutrinos 
produced by four reactions of the chain: the initial proton-proton fusion, the electron-capture decay of beryllium-7, 
the three-body proton-electron-proton (pep) fusion, here measured with the highest precision so far achieved, and 
the boron-8 beta decay, measured with the lowest energy threshold. We also set a limit on the neutrino flux produced 
by the *He-proton fusion (hep). These measurements provide a direct determination of the relative intensity of the two 
primary terminations of the pp chain (pp-I and pp-Il) and an indication that the temperature profile in the Sun is more 
compatible with solar models that assume high surface metallicity. We also determine the survival probability of solar 
electron neutrinos at different energies, thus probing simultaneously and with high precision the neutrino flavour- 
conversion paradigm, both in vacuum and in matter-dominated regimes. 


In 1937, Gamov and von Weizsacker!” suggested that the Sun is powered 
by a chain of nuclear reactions initiated by proton-proton fusion and 
leading to the production of “He. This idea was further developed by 
Bethe and Critchfield’. At about the same time von Weizsacker and 
independently Bethe proposed an alternative mechanism, namely the 
carbon-nitrogen—oxygen cycle (CNO cycle)’, a closed-loop chain of 
nuclear reactions catalysed by ’2C, ‘“N and '°O nuclei in which four 
protons are converted into “He. Although the CNO cycle was incor- 
rectly considered to be the main source of energy in the Sun (mainly 
because of the overestimation of the Sun’s central temperature available 
at that time), the debate on the role of the CNO cycle in the Sun is still 
relevant today. Indeed, a direct measure of its importance is missing, 
although theory predicts that it cannot contribute more than about 1% 
of the solar luminosity. Conversely, it is now understood to be the main 
source of energy in stars heavier than the Sun. More historical details 
can be found in ref. >. 

The Sun and lower-mass stars are predominantly powered by the 
pp chain (see Fig. 1), which has been thoroughly studied by Fowler 
and co-workers in the 1950s°. He and A. Cameron also pointed out 
that the detection of solar neutrinos could be a direct way of testing 
theoretical solar models. Their intuition was correct and neutrinos are 
now considered to be the sole direct probes of the Sun’s core and of 
solar energy generation. 

Neutrinos are copiously emitted in the primary pp fusion reaction of 
the chain and, to a minor extent, in the alternative three-body pep pro- 
cess and in the two secondary branches pp-II (7Be neutrinos) and pp-III 
(8B neutrinos). Experimentally, solar neutrinos have been studied since 
the late 1960s by radiochemical experiments (Homestake’, SAGE® and 
GALLEX?’) which, however, could only provide a measurement of their 
integrated flux on Earth above a threshold. Prior to the establishment of 
the Borexino project, only °B neutrinos (<0.01% of the total flux) have 
been measured individually by KamiokaNDE/SuperKamiokande!” and 
the Sudbury Neutrino Observatory (SNO)!!. Their measurements have 
definitively proved that neutrinos undergo leptonic flavour (that is, 
electronic, muonic or tauonic) conversion in the Sun’s matter, enhanced 


through the MSW (Mikheyey-Smirnov- Wolfenstein) mechanism!*~"4. 
For an historical review of solar-neutrino astronomy and of its impact 
on solar and neutrino physics see, for example, refs Noe 1d, 

The measurement of all neutrino components is the most direct way 
to test the standard solar model (SSM) and to validate our theoretical 
understanding of the properties of the Sun's core. The first theoretical 
predictions of neutrino fluxes were made in the 1960s by J. Bahcall 
and his collaborators and have subsequently been refined by many 
theoretical groups'®. Despite the results delivered by solar-neutrino 
experiments, important questions about the Sun remain unanswered. 
For example, the solar metallicity, that is, the abundance of elements 
heavier than He, is poorly understood, even though it is a fundamental 
parameter for the determination of the physical properties of the Sun. 
A precise measurement of the solar neutrino fluxes comprising the pp 
chain and the CNO cycle would directly settle the controversy between 
high-metallicity (HZ) and low-metallicity (LZ) SSMs!8 (see Methods). 
This study is a step in this direction. 

Solar neutrinos are also powerful probes of neutrino properties. First, 
they allow the determination of oscillation parameters, especially the 012 
mixing angle and, to a lesser degree, the Am, mass splitting. Second, 
the measurement of the electron neutrino survival probability (P,.) as a 
function of neutrino energy allows us to probe directly the MSW-LMA 
mechanism of neutrino oscillations’? and to search for deviations that 
could indicate the presence of physics beyond the standard model. 

Running continuously since 2007, the Borexino experiment has 
measured, one after another, Be neutrinos?°-”’, pep neutrinos”*, °B 
neutrinos” and pp neutrinos”. Here we report the simultaneous pre- 
cision spectroscopic measurement of the complete pp chain and its 
implications for both solar and neutrino physics. 


Borexino and the solar-neutrino analysis 

Borexino is a liquid-scintillator experiment at the Laboratori Nazionali 
del Gran Sasso in Italy*®. Given the tiny cross-section of neutrino 
interactions with electrons (o ~ 107** cm? to 107* cm? for the 
solar-neutrino energy range), the rates expected in Borexino are small, 


*A list of participants and their affiliations appears at the end of the paper. 
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Fig. 1 | Nuclear fusion sequences and neutrino energy spectrum. 
Schematic view of the pp and CNO nuclear fusion sequences. The solar- 
neutrino energy spectrum is obtained from http://www.sns.ias.edu/~jnb/, 
using the updated fluxes taken from ref. !8. The flux (vertical scale) is given 
in units of cm~* s~' MeV“! for continuum sources and in cm~? s~! for 
monoenergetic sources. 


ranging from less than one to a few tens of counts per day per 100 tons 
(t) for different solar-neutrino components. To cope with such a low 
event rate, Borexino has a large target mass (about 300 t) and is housed 
deep underground, under 3,800 m water equivalent of dolomitic rock 
that suppresses the flux of cosmic radiation by a factor of approximately 
one million. For more details on the detector, see Methods. 

Radioactive decays of unstable isotopes contained in the scintillator 
or in the materials surrounding it represent the main sources of back- 
ground (referred to as internal and external, respectively). Whereas 
external background is greatly reduced by concentric layers of high- 
purity materials surrounding the scintillator and by the selection ofa 
centrally located software-defined fiducial volume, most of the inter- 
nal background can only be cut down by means of liquid-scintillator 
purification. Particularly, interactions of beta particles (3; electrons 
and positrons) and of gamma particles (7; high-energy photons) must 
be reduced to very low levels, since they cannot be distinguished from 
neutrino interactions on an event-by-event basis. Borexino has reached 
unprecedented levels of scintillator radio-purity. As an example, one 
gram of liquid scintillator contains less than 9.4 x 10~7° grams of 
uranium-238 and less than 5.7 x 107! grams of thorium-232 (95% 
confidence level, C.L.), a concentration about ten orders of magni- 
tude smaller than in any natural material on Earth. This low level of 
background has enabled real-time detection of solar neutrinos with an 
energy threshold of 0.19 MeV, and allowed us to perform the complete 
spectroscopy of the pp chain. 

Solar neutrinos reach the Earth as a mixture of all neutrino flavours 
(electronic, muonic, and tauonic) owing to the flavour-conversion mech- 
anism enhanced by the MSW effect (see Methods). Borexino detects 
them by means of their weak elastic scattering off electrons. A fraction 
of the incoming neutrino energy E,, is transferred to one electron, which 
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deposits it in the liquid scintillator. The scintillator light is detected by 
about 2,000 photomultiplier tubes, which ensure high detection effi- 
ciency of photoelectrons produced by incident optical photons at their 
photocathodes. For Be (E, = 0.384 MeV and 0.862 MeV) and pep 
(E, = 1.44 MeV) neutrinos, the induced electron recoil endpoints are 
0.230 MeV, 0.665 MeV and 1.22 MeV, respectively. For the continuous pp 
and 8B spectra, they are 0.261 MeV and 15.2 MeV, respectively. 

The detected light and its time distribution among photomultiplier 
tubes yield three important quantities for each interaction event in 
the detector: its deposited energy, roughly proportional to the total 
number of detected photoelectrons; its position within the detector, 
obtained from the analysis of the photon arrival times at each photo- 
multiplier tube; and its particle identification, based on a pulse-shape 
discrimination method that exploits the different time structure of 
liquid-scintillator light pulses produced by different particles (elec- 
trons, positrons, o particles and protons)”’. For reference, a 1-MeV 
electron produces on average 500 photoelectrons in 2,000 photomul- 
tiplier tubes, its energy is measured with a = 50 keV and its position is 
reconstructed”®? with o = 12 cm. 

We divided the analysis into two energy regions that are affected 
by different backgrounds, which need to be handled differently: a 
low-energy region (LER) of 0.19-2.93 MeV, to measure the pp, “Be 
and pep neutrino interaction rates, and a high-energy region (HER) of 
3.2-16 MeV, to measure 8B neutrinos. For the same reason, the HER is 
further divided into two subregions, below and above 5.7 MeV (HER-I 
and HER-II). The measurement of *B neutrinos cannot be extended 
below 3.2 MeV because of the 2.614-MeV ~-ray background from 7°°T] 
decays, originating from trace ?**Th contamination of the thin nylon 
liquid-scintillator containment vessel. 

The reconstructed position of each event within the detector allows 
us to define a fiducial volume optimized differently for the analysis in 
the LER and HER-I/II. The LER fiducial volume is chosen to suppress 
external -rays from 40K, 2/4Bi and 7°°Tl contained in materials sur- 
rounding the scintillator and consists of the innermost 71.3 t of scintil- 
lator selected with a radial cut (radius R < 2.8 m) and acut in the vertical 
direction (—1.8 m < z < 2.2 m). The HER is above the energy of the 
aforementioned +-rays. The analysis in HER-I requires only az < 2.5m 
cut to suppress background events related to a small pinhole in the inner 
vessel that causes liquid scintillator to leak into the region outside the 
inner vessel. The total selected mass in this case is 227.8 t. In contrast, 
the analysis in HER-II uses the entire scintillator volume, 266 t, since 
the above-mentioned background does not affect this energy window. 

The LER analysis uses exclusively Borexino Phase-II data collected 
between December 2011 and May 2016, in which the internal 85Kr and 
?10Bi contamination was reduced with respect to Borexino Phase-l, 
thanks to a liquid-scintillator purification campaign carried out in 
2010 and 2011. The total LER exposure is 1,291.51 days x 71.3 t. 
With the exception of 7ST] decays (Q-value, total energy released 
in the decay, about 5 MeV), the HER is above the natural, long-lived 
radioactive background, making it possible to use a larger dataset, col- 
lected between January 2008 and December 2016, for a total exposure 
of 2,062.4 days x 227.8 (266.0) t for HER-I (or HER-I1), respectively. 

The analysis proceeds in two steps: (1) the event selection, with a 
different set of cuts in the three energy regions to maximize the signal- 
to-background ratio, and (2) the extraction of the neutrino and 
residual background rates with a combined fit of distributions of global 
quantities built for the events surviving the cuts. The main event selec- 
tion criteria are conceptually similar for the LER and the HER and are 
conceived to: reject cosmic muons surviving the mountain shield*”; 
reduce the cosmogenic background (that is, radioactive elements pro- 
duced in muon-induced nuclear spallation processes); and select an 
optimal spatial region of the scintillator (the fiducial volume). More 
details on the cuts are discussed in Methods. 

Several backgrounds, listed in Table 1 and described in detail in 
Methods, survive the event selection cuts. To disentangle the neu- 
trino signal from these backgrounds, two different fitting strategies 
are adopted for the LER and the HER. The LER analysis follows a 
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Table 1 | Rates of residual backgrounds 
Background LER Rate (Bq per 100 t) 


14C(0,156 MeV, /-) [40.0 + 2.0] 

Background LER Rate (counts per day per 100 t) 
85kr (0.687 MeV, 3-) (internal) 68418 

210Bj (1.16 MeV, 3°) (internal) 17.5419 

11¢ (1.02-1.98 MeV, (3*) (internal) 268402 

210P9 (5.3 MeV, a) (internal) 260.0 + 3.0 

40K (1.460 MeV, 4) (external) 1.0+0.6 

214Bi (<1.764 MeV, >) (external) 19403 

208T| (2.614 MeV, +) (external) 3340.1 


Background HER-I 


, cosmogenics, *!4Bi (internal) 


Rate (counts per day per 227.8 t) 
[6.1°87 x 10-9] 


(a, 7) (external) 0.224 + 0.078 
208T1(5.0 MeV, 3-, >) (internal) [0.042 + 0.008] 
208T1(5.0 MeV, 3-, >) (emanated) 0.469 + 0.063 
208T1(5.0 MeV, G-,») (surface) 1.090 + 0.046 


Background HER-II 


pu, cosmogenics (internal) 


Rate (counts per day per 266.0 t) 
[3.8°34° x 1073] 
0.239 + 0.022 


Residual background is due to 3° (electrons), 3* (positrons), + (gammas), j: (muons), a (alpha 
particles) and n (neutrons). The background rates are obtained by the fit to the energy spectrum 
of collected events in the three energy regions used in this study (LER, HER-| and HER-II). We 
report in parentheses the Q-value and type of particle for each background. The rates in square 
brackets are estimated independently and are constrained in the fit. Background can be internal 
(that is, due to events uniformly distributed in the scintillator volume) or external (that is, due to 
events from sources surrounding the scintillator). 


(a, n) (external) 


multivariate approach, simultaneously fitting the energy spectrum, the 
spatial and the pulse-shape estimator distributions. In the HER-I and 
HER-II, a fit of the radial distribution of events is performed to separate 
the °B neutrino signal (uniformly distributed in the scintillator) from 
the external background. 

Some residual background rates are measured independently, when- 
ever possible, and are constrained in the fit (values between squared 
brackets in Table 1). The remaining background rates are left free to 
vary and are returned by the fit together with the neutrino rates. 

The results of the fit are exemplified in Fig. 2a, which shows the 
energy spectrum in the LER after applying the threefold coinci- 
dence method (TFC) to reduce the ''C cosmogenic background 
(see Methods); Fig. 2b shows the radial distribution of the events in 
HER-I. The different contributions from signal and background as 
determined by the fit are superimposed to data in the plots. The results 
of the fit for the untagged backgrounds are summarized in Table 1. 


Results 

The high-precision solar-neutrino results obtained in this work are 
summarized in Table 2. The second column reports the measured 
rates. In the third column, we translate these measurements into the 
corresponding solar-neutrino fluxes using the known electron and ju/T 
neutrino cross-sections”’ and the flavour composition calculated 
according to the MSW-LMA paradigm (mass and mixing parameters 
from ref. !°). The fourth column shows the theoretical fluxes predicted 
by the SSM under the HZ and LZ assumptions’®. 

In the LER multivariate fit, performed to extract the pp, pep and 
7Be neutrino rates, we first constrain the CNO neutrino interaction 
rate to the value predicted by the HZ-SSM assuming the MSW-LMA 
scenario (4.92 + 0.55 counts per day per 100 t)!819 then, separately, to 
the LZ-SSM predictions (3.52 + 0.37 counts per day per 100 t). Only 
the pep neutrino rate is slightly influenced by this constraint and thus 
two results for it are reported. In both cases, the absence of the pep 
reaction in the Sun is rejected with >5o significance, enough to defin- 
itively claim discovery of solar pep neutrinos. The contribution of 8B 
neutrinos in the LER is very small and its rate was constrained to the 


ARTICLE 


Nn, 
100 200 300 400 500 600 700 800 900 
ret Leer ete ey ee ee ee ee ey ee ee 
10 — 4c —"c 
c pp —210B9 __ ---. Pile-up 
2 — 210Bj ---- External background 
sah "Be — ®kr 
. E — Total fit: P = 0.7 
x 
> 107 
xe) 
0 
ion 
£2 107 
Cc 
(3) 
= 
if 
103 


Energy (keV) 

109 SET ee a Eee err Pre 
E — Model é| 
[ ——— Neutron capture 8B 7 

sl. ee 208T]: bulk 

10° ~~. 208T|: emanation = 
E Sess 2087]: surface al 

10 


mf TTT 
jt Lint 


Events per 1,494 days x 227tx0.10m & 


10-1 
0 o5 1 145 2 25 3 35 4 #45 5 


Radius (m) 
Fig. 2 | Results of the fit used to extract the neutrino signal. 
Distributions of events after selection cuts and corresponding fits with 
neutrino and background components. a, TFC-subtracted energy 
spectrum with suppressed !'C background in LER. The horizontal upper 
scale is in units of Np, that is, the total number of photons collected for 
each event. b, Radial distribution of events in HER-I. 


value obtained from the HER analysis. Statistical uncertainties are eval- 
uated by profiling the likelihood using Wilks’s approximation, whose 
adequacy in this case is confirmed by Monte Carlo simulations. The 7Be 
solar-neutrino flux is determined with a total uncertainty of 2.7%, a fac- 
tor of 1.8 improvement with respect to our previous result” and a factor 
of two smaller than the theoretical uncertainty. The pp interaction rate 
is consistent with our previous result”® and has an uncertainty of 9.5%. 
Fits were performed with several hundred configurations, yielding 
results whose spread is incorporated in the systematic uncertainties 
(see Methods for more details). 

The 8B solar-neutrino flux derived from our measured rate in the 
entire HER is (5.68" 9:19 93) x 10°cm ~*~‘, consistent with our pre- 
vious result?* and with the high-precision determination by 
SuperKamiokande*! and SNO*. The equivalent-flavour stable °B flux, 
that is, the flux obtained attributing the measured rate entirely to elec- 
tron neutrinos, is (2.57°9'1g'9 99) x 10° cm ~*s~!. The uncertainty in 
the 8B rate determination is 8%, a more than twofold improvement on 
our previous measurement”, 

The similarity between the electron recoil spectrum induced by 
CNO neutrinos and the 7!°Bi 8-decay spectrum makes it impossible 
to disentangle the two contributions with the spectral fit. For this rea- 
son, we only provide an upper limit on the CNO neutrino interaction 
rate. To do so, we also place an indirect constraint on pep neutrinos by 
exploiting the theoretically well known pp and pep flux ratio. Using 
values predicted by the HZ-SSM'° and including the effect of MSW- 
LMA oscillations’’, the ratio of pp and pep neutrino interaction rates is 
47.8 + 0.8. Using the ratio predicted by the LZ-SSM, 47.5 + 0.8, yields 
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Table 2 | Borexino experimental solar-neutrino results 


Solar neutrino Rate (counts per day per 100 t) 


Flux (em~? s~!) 


Flux-SSM predictions (cm~? s~+) 


pp 13441079, 

7Be 48.341.1195 

pep (HZ) 2.4340.36°935 

pep (LZ) 2.6540.3675:33 

SBuER 0.13676:313 +8603 (5.77%) 
SByER II 0.087*8-080 + 8-008 (5.56* 
SBHer 0.22379-61348-908 (5.68* 
CNO <8.1 (95% C.L) 

hep <0.002 (90% C.L.) 


(6.140. 5*83) x 101° 
(4.99+0.1148-98 
(1.2740.1978-8) x 108 


(1.39+0.1979:98) x 108 


0.39+0.03 6 
0410.03) x 10 


<7.9 x 10° (95% C.L.) 


<2.2 x 10° (90% C.L) 


5.98(1.0+0.006) x 10!° 
6.03(1.0+0.005) x 101° 


4.93(1.040.06) x 10° 
4.50(1.040.06) x 10° 


1.44(1.040.01) x 108 
1.46(1.0+0.009) x 108 
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Measured neutrino rates (second column): for pp, ’Be, pep and CNO neutrinos we quote the total counts without any threshold; for 8B and hep neutrinos we quote the counts above the corresponding 
analysis threshold. Neutrino fluxes (third column) are obtained from the measured rates assuming the MSW-LMA oscillation parameters?!°, standard neutrino-electron cross-sections?’ and a density of 
electrons in the scintillator of (3.307 +0.003) x 102! electrons per 100 t. All fluxes are integral values without any threshold. The result for pep neutrinos depends on whether we assume HZ or LZ SSM 
predictions to constrain the CNO neutrino flux. The last column shows the fluxes predicted by the SSM for the HZ or LZ hypotheses!8. 


identical results. We obtain an upper limit of <8.1 counts per day per 
100 t (95% C.L.) for the CNO neutrino interaction rate, in agreement 
with the Borexino sensitivity to CNO studied with Monte Carlo. 

For completeness, we also perform a search for the hep neutrinos, 
emitted by the proton capture reaction of *He (Fig. 1). The expected 
flux is more than two orders of magnitude smaller than that of ®B neu- 
trinos. Despite their higher end-point energy, this signal in Borexino 
is extremely small and covered by background, particularly cosmo- 
genic Be decays (Q = 11.5 MeV, 3°, T= 19.9 s) and 8B neutrinos. 
We perform a dedicated analysis on the whole dataset (0.8 kt yr) and 
in the energy region 11-20 MeV we find 10 + 3 events, consistent 
with the expected background. We obtain an upper limit for the hep 
neutrino flux of 2.2 x 10° cm~” s~! (90% C.L.) to be compared with 
the expected flux 7.98 x 10? cm~*s~! (8.25 x 10° cm? s~!) assuming 
the HZ (LZ) SSM. 


Discussion and outlook 

The measurements reported in this work represent a complete study of 
the solar pp chain and of its different terminations by means of neutrino 
detection in a single detector and with a uniform data analysis proce- 
dure. These measurements can be used either to test the MSW-LMA 
paradigm assuming SSM flux predictions or, alternatively, to probe our 
understanding of solar physics assuming the validity of the neutrino 
oscillation mechanism. 

The interaction rates of pp, Be, pep and 8B neutrinos reported 
in Table 2 can be used to infer the electron neutrino survival 
probability at different energies. Assuming the HZ-SSM fluxes'® 
and standard neutrino-electron cross-sections”’, we obtain the electron 
neutrino survival probabilities for each solar-neutrino component: 
P.e(pp, 0.267 MeV) = 0.57 + 0.09, Pro(7Be, 0.862 MeV) = 0.53 + 0.05, 
and P,.(pep, 1.44 MeV) = 0.43 + 0.11. The quoted errors include the 
uncertainties on the SSM solar-neutrino flux predictions. The ®B elec- 
tron neutrino survival probability is calculated in each HER range 
following the procedure described in ref. 74. We obtain P..(8Buer, 
8.1 MeV) = 0.37 + 0.08, Pec(Burry 7-4 MeV) = 0.39 + 0.09, and 
P.e(@Burrtp 9.7 MeV) = 0.35 + 0.09. These results are summarized 
in Fig. 3. For non-monoenergetic components, that is, pp and °B neu- 
trinos, the P.. value is quoted for the average energy of neutrinos that 
produce scattered electrons in the given energy range. 

Borexino provides the most precise measurement of the P,, in the 
LER, where flavour conversion is vacuum-dominated. At higher energy, 
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where flavour conversion is dominated by matter effects in the Sun, 
the Borexino results are in agreement with the high-precision meas- 
urements performed by SuperKamiokande*! and SNO*. Borexino is 
the only experiment that can simultaneously test neutrino flavour con- 
version both in the vacuum and in the matter-dominated regime. We 
performed a likelihood ratio test to compare our data with the MSW- 
LMA and the vacuum-LMA predictions (pink and grey bands in Fig. 3, 
respectively). Our data disfavour the vacuum-LMA hypothesis at 98.2% 
C.L. (see Methods). Overall, the results are in excellent agreement with 
the expectations from the MSW-LMA paradigm with the oscillation 
parameters indicated in ref. 1°. 

Since solar neutrinos are detected on Earth only about 8 min after 
being produced, they provide a real-time picture of the core of the Sun. 
In particular, the neutrino fluxes determined experimentally can be 
used to derive the total power generated by nuclear reactions in the 
Sun’s core*?. By using exclusively the new Borexino results reported in 
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Fig. 3 | Electron neutrino survival probability P,, as a function of 
neutrino energy. The pink band is the +1o prediction of MSW-LMA 
with oscillation parameters determined from ref. '. The grey band is the 


vacuum-LMA case with oscillation parameters determined from refs **°?. 


Data points represent the Borexino results for pp (red), “Be (blue), pep 
(cyan) and §B (green for the HER range, and grey for the separate HER-I 
and HER-II sub-ranges), assuming HZ-SSM. 8B and pp data points are set 
at the mean energy of neutrinos that produce scattered electrons above the 
detection threshold. The error bars include experimental and theoretical 
uncertainties. 
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Fig. 4 | Borexino results and analysis in the ®(’Be)-®(°B) space. 
Borexino results for Be and 8B neutrino fluxes (green point and shaded 
area). Allowed contours in the &(’Be)-®(*B) space are obtained by 
combining these new results with all solar and KamLAND data in a global 
analysis, and leaving free the oscillation parameters 6). and Am,) (grey 
ellipse, marked as GLOBAL). The theoretical prediction for the low- 
metallicity (LZ) (blue) and the high-metallicity (HZ) (red) Standard Solar 
Models (SSM)'* are also shown. The fit returns the following oscillation 
parameters: tan?0,) = 0.47 + 0.03 and Am; = (7.5 x 10-5) + 0.03, in 
agreement with what is reported in ref. !? (sin?0); is fixed to 0.0217; ref. 1°). 
All contours correspond to 68.27% CLL. 


Table 2, we find L = (3.89103) x 10° erg s},in agreement with the 
luminosity calculated using the well measured photon output****, 
L = (3.846 + 0.015) x 10% erg s~!. This confirms experimentally the 
nuclear origin of the solar power with the best precision obtained by a 
single solar-neutrino experiment. Considering that it takes around 10° 
years for radiation to flow from the energy-producing region to the 
surface of the Sun, this comparison proves also that the Sun has been 
in thermodynamic equilibrium over this timescale. 

Furthermore, we derive the ratio Ry between the *He-*He and the 
3He—*He fusion rates, which quantifies the relative intensity of the two 
primary terminations of the pp chain (pp-II and pp-I; see Fig. 1), a 
critical probe of solar fusion. Neglecting the 8B neutrino contribution, 
this ratio can be extracted from the measured pp and Be neutrino 
fluxes by the relation*®, Ry = 26(’Be)/[®(pp) — &(7Be)]. We find 
Ri = 0.178") 055, in agreement with the most up-to-date predicted 
values of Ry11 = 0.180 + 0.011 (HZ) and 0.161 + 0.010 (LZ)!®. 

Finally, the Borexino measurements can be used to test the predic- 
tions of SSMs with different metallicity. Indeed, the assumed metal- 
licity determines the opacity of solar plasma and, as a consequence, 
regulates the central temperature of the Sun and the branching ratios 
of the different pp-chain terminations. To perform this test, we use 
only the results for 7Be and °B neutrinos, whose fluxes are very dif- 
ferent in the HZ- and the LZ-SSM theoretical predictions (differences 
of 9% and 18%, respectively). Figure 4 shows the results of Borexino 
(green-shaded ellipse), together with the predictions for the HZ- and 
LZ-SSMs!8 (red- and blue-shaded ellipses, respectively). Note that the 
errors in the Borexino measurements are in both cases smaller than the 
theoretical uncertainties. The theoretical error budget is dominated by 
uncertainties on the astrophysical factor S34 of the “He + *He reaction, 
on the opacity of the Sun, and on the astrophysical factor S)7 of the 
p + Be reaction as discussed in ref. '*. 

The Borexino results are compatible with the temperature pro- 
files predicted by both HZ- and LZ-SSMs. However, the ’Be and 8B 
solar-neutrino fluxes measured by Borexino provide an interesting hint 
in favour of the HZ-SSM prediction. A frequentist hypothesis test based 
on a likelihood-ratio test statistics (HZ versus LZ) was performed by 
computing the probability distribution functions with a Monte Carlo 
approach. Assuming HZ to be true, our data disfavour LZ at 96.6% C.L. 
This constraint is slightly stronger than our sensitivity (the median 
sensitivity is at 94.2% C.L.). A Bayesian hypothesis test” yields a Bayes 
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factor of 4.9, confirming a mild preference for HZ (see Methods for 
more details on both the frequentist and Bayesian studies). 

For the sake of completeness, we performed a global fit including the 
results presented in this work together with all the other solar + 
KamLAND data. Following the procedure described in ref. 27 we leave 
the oscillation parameters 0), Am,, and the ’Be and 8B neutrino fluxes 
free to vary in the fit. Figure 4 shows the allowed regions in the ®(’Be)- 
(8B) space determined from this global analysis. The oscillation 
parameters returned by the fit are consistent with the ones obtained in 
ref. '°. It is clear from the output of this global fit that when the Borexino 
results are combined with those of all other solar-neutrino experiments, 
the small hint towards HZ further weakens. 

In summary, we have reported simultaneous measurements of solar 
neutrinos from all the reactions belonging to the pp nuclear fusion chain. 
This study confirms the nuclear origin of the solar power and provides 
the most complete real-time insight into the core of our Sun so far. 
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METHODS 


The Borexino detector. Borexino is a large liquid-scintillator experiment located 
deep underground at the Laboratori Nazionali del Gran Sasso in Italy. Borexino is 
designed to achieve extremely low background conditions. The active core of the 
detector consists of about 300 t of pseudocumene (1,2,4-trimethylbenzene) doped 
with 1.5 g per litre of PPO (2,5-diphenyloxazole) and contained in a spherical 
nylon inner vessel (radius R = 4.25 m). The scintillator is surrounded by a non- 
scintillating pseudocumene-based buffer liquid which serves as a shield against 
external radioactivity (see Extended Data Fig. 1). The scintillator fluorescence light 
is collected by 2,212 photomultiplier tubes mounted on the Stainless Steel Sphere 
(radius R = 6.9 m). The entire detector is enclosed in a domed, cylindrical tank 
filled with high-purity water, equipped with 208 photomultiplier tubes, which 
provides extra shielding against external radioactivity (photons and neutrons), 
and also serves as an active water Cherenkov veto against residual cosmic muons. 
A detailed description of the Borexino detector is found in ref. *°. 

The SSM and the solar metallicity controversy. The SSM is a solution of the 
stellar evolution equations for stars of one solar mass, calibrated to match pres- 
ent-day, measured surface properties of the Sun. A fundamental assumption is that 
the Sun was initially chemically homogeneous and that during its 4.56-Gyr-long 
evolution, it has modified its chemical composition solely due to nuclear reactions 
and elemental diffusion. The model calibration is done by adjusting the mixing 
length parameter and the initial chemical composition in order to reproduce the 
observed solar luminosity, radius, and current surface composition. As a result of 
this procedure, the SSM has no free parameters and completely determines the 
mechanical and thermal properties of the Sun. 

The SSM predicts that most of the solar energy (>99%) is produced by the 
so-called pp chain (see Fig. 1) that fuses hydrogen into “He: the chain is initiated 
by the pp fusion reaction and, to a minor extent, by the alternative three-body pep 
process. These reactions produce deuterons, which are efficiently converted into 
3He by the subsequent deuteron-proton reaction. The pp chain mostly terminates 
with the He + *He — “He + 2p reaction (pp-I termination). In the late 1950s, the 
cross-section for the competing *He + “He — ’Be + +y reaction was discovered*” to 
be about one thousand times larger than previously thought, causing the branching 
ratios of the pp-II and pp-III terminations to be non-negligible. An alternative pro- 
cess is the so-called CNO cycle, a closed-loop nuclear reaction in which °C, !4N, 
and '6O nuclei catalyse hydrogen fusion into ‘He. The CNO cycle is a subdominant 
energy-producing mechanism in stars like the Sun or lighter, but is believed to be 
the dominant fusion mechanism in heavier or older stars. 

For each “He nucleus produced in the Sun, two electron-flavour neutrinos are 
emitted. Neutrinos free-stream across the solar plasma and reach the Earth trav- 
elling close to the speed of light in about 8 min, resulting in a total flux of about 
6.5 x 10'°cm~?s”!. The solar-neutrino spectrum depends on the branching ratios 
of the different pp chain terminations and on the relative intensity of the pp chain 
and the CNO cycle. A large percentage (about 90%) of the neutrinos emitted by the 
Sun are produced in the primary pp fusion reaction (producing pp neutrinos). Most 
of the remaining 10% of the solar-neutrino flux is emitted in the electron capture 
reaction on “Be (producing “Be neutrinos), which appears along the pp-II branch 
of the chain. Smaller contributions come from pep fusion (the pep neutrinos) and 
from B decays in the pp-III branch (producing *B neutrinos). Neutrinos from 
proton capture of *He (hep neutrinos) are expected to be emitted with negligi- 
ble probability (10-7) and are beyond current detection sensitivity. The predicted 
energy spectrum of all neutrinos emitted along the pp chain, including spectral 
shapes and intensity before neutrino oscillations are shown in Fig. 1. 

The predictions of the SSM have been tested by solar-neutrino experiments 
and by helioseismology (which determines the properties of the solar interior by 
studying the propagation of seismic waves at the Sun’s surface). However, important 
questions about the Sun still call for an answer. For example, the solar metallicity— 
the abundance of elements heavier than He—is poorly understood, although it is 
a fundamental input when constructing SSMs and a relevant parameter in astro- 
physics, since almost all determinations of elemental abundances in astronomical 
objects rely upon the solar composition. Recent determinations of the solar surface 
composition‘! suggest that the solar metallicity might be lower than previously 
assumed***, SSMs that incorporate these lower abundances, however, agree less 
well with helioseismic data: this is often referred to as the solar metallicity problem. 

Solar-neutrino measurements provide fundamental clues for the solution of 
this puzzle. Indeed, the opacity of the solar plasma is strongly influenced by the 
presence of heavy elements. Since opacity determines the efficiency of radiative 
energy transfer, the metal content of solar matter affects the temperature profile 
of the Sun. As a consequence, metallicity determines the branching ratios for the 
various terminations of the pp chain, as well as the relative intensity of the pp chain 
with respect to the CNO cycle. A precise determination of the solar-neutrino fluxes 
comprising both the pp chain and the CNO cycle is thus a direct, robust way to set- 
tle the solar metallicity controversy. In the main text we compare our experimental 
results with predictions of HZ and LZ SSMs'8. 
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Neutrino oscillations and the MSW effect. For many years, the experimental 
results on solar neutrinos have been at odds with the predictions of the SSM: all 
the experiments observed a large deficit of neutrinos with respect to expectations. 
This 30-year-long controversy was settled only in 2002 by the experiment SNO", 
which proved unambiguously that the solution to the ‘solar-neutrino problem’ was 
not to be searched for in solar physics, but in neutrino physics, namely, in the 
quantum mechanics phenomenon of flavour oscillations!”. Through this mecha- 
nism, solar neutrinos, which are born in the Sun as electron neutrinos, v7, have a 
non-zero probability to transform into neutrinos with a different flavour (either 
u, or v,) during propagation and are therefore less likely to be detected on Earth. 
For oscillations to occur, two conditions must be met: (1) mass and flavour eigen- 
states for neutrinos must not coincide, which implies the existence of a non-trivial 
mixing matrix which transforms one into the other; and (2) the mass of at least 
one neutrino must be different from 0. The relevant parameters for solar-neutrino 
oscillations are the mixing angle 6,2 and the squared mass difference between the 
mass eigenstates, mostly contributing to v., that is, Am ). The probability of flavour 
conversion is enhanced when neutrinos cross the dense solar medium, because of 
coherent forward scattering on electrons. This mechanism is referred to as the 
Mikheyey-Smirnov—Wolfenstein (MSW) matter effect!*!4 and for the specific 
values of Am;, and of the Sun density profile it fully describes solar neutrinos with 
energies greater than about 5 MeV. For energies below 1 MeV, the vacuum oscil- 
lation mechanism dominates, whereas in the intermediate-energy region, a smooth 
transition occurs. Figure 3 shows the survival probability P.. for ve produced in the 
Sun as a function of the neutrino energy (pink curve) for the oscillation parameters 
obtained by a global fit to all solar-neutrino, reactor and accelerator experiments”. 
The values of Am) (about 7.5 x 1075 eV) and of 0; (about 33°) correspond to 
the so-called Large Mixing Angle solution (LMA) of the solar-neutrino problem. 
Event selection and residual backgrounds. The analysis starts with data selection 
aimed at reducing the rate of background events. The selection criteria, concep- 
tually similar for the LER and HER, are conceived to: (1) reject cosmic muons 
penetrating the mountain shield; (2) reduce the cosmogenic background, that is, 
the decays of short-lived radioactive elements produced in muon-induced nuclear 
spallation processes in the detector; and (3) select a fiducial volume of the scintil- 
lator, optimized separately for the LER and HER-I/II analyses. 

Rejection of muons is achieved by combining the external Cherenkov veto 
information with a pulse shape analysis of the scintillator signals, and displays 
an overall efficiency*’ of 99.992%. The reduction of cosmogenic background is 
obtained by excluding events collected during a given time At following every 
muon crossing the scintillator. 

For the LER, a short muon veto time At = 300 ms is enough to efficiently 
suppress most relevant cosmogenic isotopes. An exception is |'C (Q = 0.96 MeV, 
3*, 7 = 29.4 min), which is produced in situ by muon spallation, and has a mean 
lifetime that greatly exceeds the short muon veto time cut. !'C has a fairly constant 
concentration in the scintillator (around 30 counts per day per 100 t) determined 
by the equilibrium between its production and decay rate and cannot be reduced by 
any purification procedure. It is therefore one of the most important backgrounds 
and must be treated with a specific analysis (see next paragraph). 

For the HER, the rejection of cosmogenic background requires a larger 
time window of At = 6.5 s to suppress PB He, °C, °Li, 8B, “He and *Li decays. 
Furthermore, for the HER analysis a 2-ms veto is applied after muons that cross 
the buffer liquid only. This veto aims at rejecting 4.95-MeV ~-rays following the 
capture of cosmogenic neutrons on ?C nuclei; an additional cut is applied around 
the capture position of cosmogenic neutrons, when this happens inside the scin- 
tillator, to remove !°C (Q =3.6 MeV, 3*, 7 = 27.85). 

Both in the LER and in the HER, 7““Bi and ?!“Po from the 7**U natural decay 
chain are removed by exploiting the space-time correlation of their fast 6 + a 
delayed coincidence decays. 

The analysis in the LER and HER-I/II use different fiducial volumes. The LER 
fiducial volume focuses on suppressing external +-rays from “°K, 7/4Bi and ?°T1 
contained in materials surrounding the scintillator. It consists of the central 71.3 t of 
scintillator, selected by applying a radial cut (R < 2.8 m) and a cut along the vertical 
axis (—1.8m < z < 2.2 m). The HER is above the energy of the aforementioned 
+-rays. The analysis in HER-I only requires a z < 2.5 m cut to suppress background 
events related to a small pinhole in the nylon vessel that causes scintillating fluid 
to leak into the region surrounding it. The total HER-I target mass is 227.8 t. The 
analysis in HER-II uses the entire scintillator volume of 266 t. More details on the 
selection criteria can be found in refs #475”, 

After the selection cuts described above, some residual background remains 
both in the LER and in the HER. The LER residual background is detailed in 
Table 1, and is mostly due to traces of radioactive isotopes contaminating the 
scintillator—that is, C, ?!°Po (either from *!°Pb decay or out of equilibrium), 
85Ky, 210Bi (from 7!°Pb) and pile-up of uncorrelated events. A small contribu- 
tion to the LER rate also comes from external *°*T], 7!4Bi and “°K ~\-rays emerg- 
ing from materials surrounding the scintillator. In the LER fit, the C rate is 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


quantified and constrained using an independent sample of events acquired with- 
out any trigger threshold”*. The contribution of pile-up, dominated by simulta- 
neous '4C decays at different detector positions, is treated using the following two 
methods described in refs °°: in one case, we construct the pile-up spectrum 
starting from real or Monte Carlo datasets; in the other, we convolve all spectral 
components with a randomly acquired spectrum (that is, with events acquired 
with a solicited, external trigger). 

The residual backgrounds affecting the HER-I and HER-II are also listed in 
Table 1. Some of the internal events (that is, events uniformly distributed in the 
scintillator volume) are due to muons, cosmogenic isotopes, and 7!“Bi decays sur- 
viving the cuts. The total contribution of these backgrounds has been evaluated 
separately for the HER-I and the HER-II, following the procedure described in 
ref. 24, and constrained in the fit. In addition, the presence of untagged Be 
(Q =11.5 MeV, 3°, 7 = 19.9 s) is estimated by adopting a technique based on a 
multivariate fit, which includes the energy spectrum and the time profile of events 
with respect to the preceding muon, and is found to be compatible with zero. 
The HER-1 is also affected by internal ?°°T] decays, which come from the residual 
232Th contamination of the liquid scintillator. In the fit, this rate is constrained to 
the value obtained by counting the 7!”Bi-?!?Po 3 + « fast delayed coincidences. 
External 7°°T] contamination contributes to the HER-I with two distinct compo- 
nents: one from contamination directly on the inner vessel surface, and another 
from decays of nuclei that have recoiled off the inner vessel into the liquid scintil- 
lator or originated from the volatile progenitor of °°Tl, 7°Rn, which has emanated 
out of the nylon. The rates of both components are left free to vary in the radial 
fit. Finally, HER-I and HER-I/ are also polluted by ~-rays following the capture of 
radiogenic neutrons produced via (a, 1) or spontaneous fission reactions of 238, 
235U and ?*Th in the Stainless Steel Sphere and photomultiplier tubes. This rate is 
also a free parameter of the fit. 

The !'C background. The !!C background is not removed by the short veto cut 
after muons. To disentangle its contribution from the neutrino signal, we use the 
TEC method”?””, which exploits the time and space correlation between muons, 
the neutrons they produce in combination with '!C, and the subsequent !'C 
decays. With this method we divide the events passing the selection cuts in two 
complementary datasets: one is depleted in ''C (TFC-subtracted) and preserves 
(64.28 + 0.01)% of the total exposure; the other contains (92 + 4)% of the !4C 
(TFC-tagged). The energy spectra of these two datasets are fitted simultaneously 
in the multivariate fit (see next paragraph). The residual ''C (positron) background 
in the TFC-subtracted spectrum is further disentangled from electron-like events 
by including in the multivariate fit the distribution of a pulse-shape discrimination 
variable”>’. It is in fact observed that the time distribution of scintillation photons 
slightly differs between electron and positron events, for the following reasons: (1) 
positron produces ortho-positronium half of the time, which delays the annihilation 
by around 3 ns (ref. *°); (2) the positron energy deposition occurs in multiple sites 
within the detector, owing to the production of annihilation \-rays. These effects 
tend to delay and extend the time distribution of the scintillator pulse for positrons 
with respect to electron events, a handle we exploit for |'C background rejection. 
Fitting procedure for extraction of solar-neutrino rates. To disentangle the 
neutrino signal rates from the residual background, we apply different fitting 
strategies for the LER and the HER. For LER, we adopt a multivariate approach 
and simultaneously fit the TFC-subtracted and the TFC-tagged energy spectra, 
the spatial distribution, and the distribution of the pulse-shape discrimination 
variable. The spatial distribution is crucial to separate the residual external back- 
ground component, while the pulse-shape estimator is optimized to separate 
positrons from electrons, which is key to disentangling ''C from the other fit 
species (see above). The reference radial distributions for external and internal 
events used in the multivariate fit are built with a comprehensive Geant4-based 
Monte Carlo simulation, carefully tuned and validated with calibration data”®*”’. 
The spectral shapes of signal and background components used in the multivar- 
iate fit of the LER are also obtained from simulations. In addition, the fit of the 
energy spectra is performed using analytical spectral functions*>’’, where the 
nonlinearity of the energy scale (due, for example, to ionization quenching and 
Cherenkov light emission) and the spatially non-uniform detector response are 
included via nuisance parameters, some of which are left free to vary in the fit. 
The reference positron pulse-shape distribution used in the LER multivariate fit 
is based on events selected with the TFC method described above, tuned to obtain 
a nearly pure sample of !!C events. The reference electron pulse-shape distribu- 
tion is obtained from simulations and checked on data using electron-like events 
isolated via the /Bi-?!4Po coincidences. 

In the HER-I and HER-II, the analysis is based on a fit to the radial distribution 
of the events to separate the *B neutrino signal (uniformly distributed in the scin- 
tillator) from the external background components. Like the LER fit, the reference 
radial distributions for external and internal events used in the HER fit are built 
with Geant4-based Monte Carlo simulations. For more details on the fit to extract 
the neutrino signal see ref. 7”. 


Systematic uncertainties in the analysis. The detector energy response and uni- 
formity has been carefully studied by means of an extensive calibration campaign 
which was carried out in 2009*8. The calibration data were used to tune the input 
parameters of the Borexino Monte Carlo package, a custom Geant4-based code*” 
that can simulate all processes following the interaction of a particle in the detector, 
including all known characteristics of the apparatus”. After tuning, the agreement 
between Monte Carlo and calibration data is very good for both the LER and the 
HER: for the energies relevant to the LER analysis, the overall uncertainty is below 
1%, while for the HER analysis, it is around 1.9%. 

In spite of this remarkable understanding of the detector response throughout 
the scintillator volume and in a large energy range, an extensive study of possible 
sources of systematic errors has been performed both for the LER and for the 
HER. The results of these studies are summarized in Extended Data Tables 1 and 2, 
respectively. 

Concerning the analysis in the LER, the main contribution to the systematic 
error comes from the fit model, that is, possible residual inaccuracies in the mod- 
elling of the detector response (energy scale, uniformity of the energy response, 
pulse-shape discrimination shape) and uncertainties in the theoretical energy 
spectra used in the fit. These systematic effects have been estimated by means 
of a Monte Carlo method: an ensemble of 100,000 datasets are simulated from a 
family of probability density functions, which includes deformations caused by 
the inaccuracies under study. The magnitude of the deformations was chosen to 
be within the range allowed by the available calibration data. These data are then 
fitted following the same procedure used for real data and differences in the results 
are quoted as systematics (see first line in Extended Data Table 1). 

The second source of systematics is related to the fit method, that is, whether 
the reference probability density functions used in the fit are entirely derived from 
Monte Carlo simulations or analytically. Further systematic effects arise from the 
choice of the energy estimator, from the details of the implementation of the pile-up 
of uncorrelated events, from using different fit energy ranges and binning, from the 
inclusion of an independent constraint on **Kr obtained from its sub-dominant 
delayed coincidence decay (branching ratio 0.43%), and from the estimation of 
the target fiducial mass. This last uncertainty is determined with calibration data, 
by using sources deployed in known positons throughout the detector volume. 

Concerning the HER analysis, the most important systematic uncertainties arise 
from the determination of the target mass, from the energy scale, and from the 
z-cut applied in the HER-I range (see Extended Data Table 2). 

The target mass uncertainty is related to the fact that the amount of scintillator 
contained in the inner vessel is slowly decreasing (by less than 0.5 m? per year), due 
to a small pinhole in the nylon membrane. We monitor the evolution of the scin- 
tillator mass on a week-by-week basis, by studying the inner vessel shape, which is 
obtained from the spatial distribution of its surface contamination. This method 
gives an average total mass of 266 t with an error of about 2%. 

The impact of the uncertainty of the energy scale on the number of events falling 
in the HER-I and HER-II energy window has been evaluated with a full Monte 
Carlo simulation and has been included in the systematic error (see second line 
of Extended Data Table 2). 

As mentioned in the main text, the HER-I analysis requires a cut on the vertical 
coordinate to remove background events owing to a small pinhole in the nylon 
vessel that causes the scintillator to leak into the buffer liquid. To estimate possible 
systematics associated to this cut, the HER-I analysis was performed with a modi- 
fied z-cut, +0.5 m around the chosen value (2.5 m). Differences in the results have 
been included as systematic error. 

Frequentist hypothesis test of MSW versus vacuum oscillations. Borexino 
provides results on the electron neutrino survival probability (P,.) in the entire 
solar-neutrino energy range. We are therefore able to perform a statistical study 
to compare the compatibility of our measurement with two different hypotheses: 
the standard oscillation scenario, MSW-LMA, and the vacuum-LMA scenario, in 
which matter effects are not present (and which is taken as our null hypothesis). 

The survival probability PMS“ '™ in the MSW-LMA scenario depends not 
only on the oscillation parameters 12, 6)3 and Am), valid in vacuum, but also on 
the neutrino-energy-dependent potential characterizing the interaction of neutri- 
nos with the dense solar core. It can be expressed as follows: 


z 1 
puewlMe = 5008013 (1 + cos26,M'cos26,,) 
where 
cos26,, — 
cos20," 2 5 B = 
(cos20,, — 3)°+sin°26 
12 12 
and 
gu 2./2 G,c0s’6,,nE, 
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where @™ is the mixing angle in matter, Gy is the Fermi coupling constant and n, is 
the density of electrons in the matter. Using the current set of oscillation parameters 
and errors derived in ref. !°, and following the procedure described in ref. 27 we 
obtain the pink band in Fig. 3. 

If matter effects were not present, the survival probability for solar neutrinos 
would be approximated by the expression Py": 


1 
pyc cos! | 1- 5 sin 20, | +sin'O,; 


which is independent of the neutrino energy E,. Taking for 613 and 6), the values 
and errors measured by reactor neutrino experiments in refs **?, the survival 
probability P""™ as a function of E,, corresponds to the grey band in Fig. 3. 


We performed a frequentist analysis, in which we adopt a test statistics t based on 
the ratio between the likelihood L obtained assuming MSW-LMA and vacuum-LMA: 


t = — 2log[L(MSW) /L(vacuum)] = \?(MSW) - x?(vacuum) 


The probability distribution of t is built with a Monte Carlo method: we ran- 
domly generate thousands of values of P,, in the MSW-LMA hypothesis (by sam- 
pling the pink curve in Fig. 3 and including both theoretical and experimental 
uncertainties) and for each set of data we estimate t and build its distribution (red 
curve on the left in Extended Data Fig. 2). In the same way, we simulate thousands 
of Pee values in the vacuum-LMA hypothesis and we build the corresponding t 
distribution (blue curve on the right in Extended Data Fig. 2). 

The actual Borexino results for P.. for pp, “Be, pep and 8B gives a value of 

tgx = —4.16 (indicated as a dashed line in Extended Data Fig. 2), which allows us 
to disfavour the vacuum-LMA hypothesis with a P value of 0.018 (integral of the 
small tail of the blue curve to the left of tgx), corresponding to a C.L. of 98.2%. For 
more details on the choice of the test statistics see ref. “”. 
Frequentist and Bayesian hypothesis test of the HZ versus LZ models. The com- 
bination of the Borexino measurement on 8B and “Be fluxes provides an interesting 
hint in favour of the solar temperature profile predicted by the HZ-SSM. This was 
obtained by performing both a frequentist and a Bayesian hypothesis test. 

In the frequentist analysis, we used a test statistics t based on the ratio between 
the likelihood obtained assuming HZ and LZ: 


t = — 2log[L(HZ)/L(LZ)] = x°(HZ) — x°(LZ) 


The probability distribution of t is built with a Monte Carlo method 
(full Neumann construction of the confidence intervals): we randomly generate 
thousands of fake “Be-*B results in the HZ hypothesis (sampling a distribution 
that includes both theoretical and experimental errors) and for each set of data 
we estimate t (red distribution on the left in Extended Data Fig. 3). In the same 
way, we simulate thousands of fake ’Be-*B results in the LZ hypothesis and we 


ARTICLE 


build the corresponding ¢ distribution (in blue on the right in Extended Data 
Fig. 3). 

The value of ¢ corresponding to the actual Borexino result for 7Be-*B is shown 
in the plot as the dotted line at tgx = —3.49, relatively far from the maximum of 
the LZ probability distribution (blue curve). This allows us to disfavour the LZ 
hypothesis with a P value of 0.034 (integral of the small tail of the blue curve to the 
left of tgx), corresponding to a C.L. of 96.6%. The result is slightly better than the 
median P value expected (0.058), which corresponds to a median significance of 
94.2% C.L. For more details on the choice of the test statistics see ref. ”°. 

In the Bayesian analysis we constructed two models, one for the HZ and the 
other for the LZ hypothesis, in which the free parameters are the fluxes of §B and 
of ’Be. The model predictions are used as prior probability distributions. The like- 
lihood is constructed as the sum of two Gaussian measurements, one for the flux 
of 8B and the other for the flux of “Be. 

We compare the two models assuming that they have the same probability a 
priori (50% for the HZ hypothesis and 50% for LZ hypothesis). Like the frequentist 
analysis, the data show a mild preference for HZ with respect to LZ. The odds 
are 5:1 or, equivalently, the Bayes factor is 4.9. For more details on the Bayesian 
method see ref. *”. 


Data availability 

The datasets generated during the current study are freely available in the repos- 
itory https://bxopen.Ings.infn.it/. Additional information is available from the 
Borexino Collaboration spokesperson (spokesperson-borex@Ings.infn.it) upon 
reasonable request. 
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Extended Data Fig. 1 | The Borexino detector. Schematic view of the photomultiplier tubes (PMTs) are mounted; the outermost nylon vessel, 
‘onion-like’ structure of the Borexino apparatus. From outside to inside: which serves as a barrier against radon; the innermost nylon vessel, which 
the external water tank; the Stainless Steel Sphere, where about 2,200 contains 300 t of liquid scintillator, the active detection medium 
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Extended Data Fig. 2 | Frequentist hypothesis test of MSW-LMA and §B energies) in the MSW-LMA hypothesis (red curve on the left) and 
versus vacuum-LMA. The probability distribution of the test statistics tis in the vacuum-LMA hypothesis (blue curve on the right). The dotted black 
obtained by simulating thousands of sets of P,. values (at the pp, “Be, pep line corresponds to the results of Borexino discussed in the main text. 
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Extended Data Fig. 3 | Frequentist hypothesis test for LZ and HZ. The on the left) and in the LZ hypothesis (blue curve on the right). The dotted 
probability distribution of the test statistics t is obtained by simulating black line corresponds to the results of Borexino discussed in the main 


thousands of fake sets of SB-Be values in the HZ hypothesis (red curve text. 
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Extended Data Table 1 | LER analysis systematics 


pp neutrinos 7Be neutrinos | pep neutrinos 


Source of uncertainty 


Fit models (see text) -4.5 +0.5 -1.0 +0.2 -6.8 +2.8 
Fit method (analytical/Monte Carlo) -1.2 +1.2 -0.2 +0.2 -4.0 +4.0 
Choice of the energy estimator -2.5 +2.5 | -0.1 +0.1 -2.4 +2.4 
Pile-up modeling -2.5 +0.5 0 0 0 0 
Fit range and binning -3.0 +3.0 -0.1 +0.1 -1.0 +1.0 
Inclusion of the *Kr constraint -2.2 +2.2 0 +0.4 -3.2 0 
Live time -0.05 +0.05 | -0.05 +0.05 | -0.05 +0.05 
Scintillator density -0.05 +0.05 | -0.05 +0.05 | -0.05 +0.05 
Fiducial volume -1.1 +0.6 | -1.1 +0.6 -l.1  +0.6 
Total systematics (%) -7.1 +4.7 -1.5 +0.8 -9.0 +5.6 


Relevant sources of systematic uncertainties and their contributions to the measured neutrino interaction rates for the LER analysis. 
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Extended Data Table 2 | HER analysis systematics 


HER-I HER-I HER (tot) 


Source of uncertainty 


Target mass -2.0 +2.0 | -2.0 +2.0 -2.0 +2.0 

Energy scale -0.5 +0.5 | -4.9 +4.9 -1.7 +17 

z-cut -0.7 +0.7 0 0 -0.4 +0.4 

Live time -0.05 +0.05 | -0.05 +0.05 | -0.05 +0.05 

Scintillator density -0.05  +0.05 | -0.05 +0.05 | -0.05 +0.05 

Total systematics (%) -2.2 +2.2 | -5.3 +5.3 -2.7 42.7 
Relevant sources of systematic uncertainties and their contributions to the measured neutrino interaction rates for the HER analyses. 
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Large meteorite impact structures on the terrestrial bodies of the Solar System contain pronounced topographic rings, 
which emerged from uplifted target (crustal) rocks within minutes of impact. To flow rapidly over large distances, these 
target rocks must have weakened drastically, but they subsequently regained sufficient strength to build and sustain 
topographic rings. The mechanisms of rock deformation that accomplish such extreme change in mechanical behaviour 
during cratering are largely unknown and have been debated for decades. Recent drilling of the approximately 200-km- 
diameter Chicxulub impact structure in Mexico has produced a record of brittle and viscous deformation within its peak - 
ring rocks. Here we show how catastrophic rock weakening upon impact is followed by an increase in rock strength that 
culminated in the formation of the peak ring during cratering. The observations point to quasi-continuous rock flow 
and hence acoustic fluidization as the dominant physical process controlling initial cratering, followed by increasingly 


localized faulting. 


Large hypervelocity impact structures show a distinct size-morphology 
progression’ (Fig. 1), which depends on the gravity and target rock type 
of the impacted body. In this regard, the study of internal topographic 
rings—so-called peak rings’—are of particular importance in under- 
standing the formation of peak-ring impact structures (Fig. 1b) and 
multi-ring impact basins (Fig. 1c)’. As crater diameter increases beyond 
the maximum size of a bowl-shaped crater, the depth-to-diameter ratio 
of the crater decreases. On Earth, peak-ring crater formation (Fig. 2, 
Supplementary Information) takes place in minutes! and implies 
extreme deformation rates accompanying large displacements. Peak- 
ring craters can be a few hundred kilometres in diameter, yet merely a 
few kilometres deep, with the peak rings greatly elevated above crater 
floors. To explain this topographic characteristic, peak-ring crater 
formation requires drastic mechanical weakening of the target rocks. 
Weakening is thought to be caused by a decrease in the angle of internal 
friction and cohesion and results in large-scale fluid-like behaviour of 
target rock during part of the cratering process*”. Towards the end of 
the cratering process, however, rock strength needs to be sufficiently 
high to form and sustain topographically elevated peak rings. 

A number of mechanisms for target-rock weakening have been pro- 
posed. These include impact-induced fracturing and fragmentation of 
the target rocks*"'®, wholesale thermal softening by shock heating’, fault 
weakening’” by shear heating’ or other processes, and acoustic fluidi- 
zation'®”°. In this last process, short-wavelength, high-frequency pres- 
sure oscillations around the lithostatic pressure temporarily reduce the 
overburden pressure and, thus, friction between fractured target rocks. 

Because direct observations are extremely limited, the exact mecha- 
nisms and duration of target rock weakening during large impact cra- 
tering are unknown. In particular, unequivocal physical evidence for 
acoustic fluidization or fault weakening in large impact craters remains 
to be identified. Large extra-terrestrial craters can only be analysed 
by remote sensing, which provides little or no subsurface structural 


information. With estimated original diameters between 200 km and 
250 km, Vredefort (South Africa), Sudbury (Canada) and Chicxulub 
(Mexico), known as ‘the big three”!, are the largest impact structures 
known on Earth. Vredefort and Sudbury, however, are eroded to variable 
depths” of about 10 km and about 5 km, respectively, and so are 
largely missing the upper and most displaced target rocks (Fig. 2d). 
Chicxulub is the sole near-pristine, large impact structure with a topo- 
graphic peak ring on Earth (Fig. 1d)?*-*, but post-impact sedimentary 
strata hundreds of metres thick have buried the impact structure, 
hindering direct access to the target rocks. Recent drilling by Expedition 
364 of the International Ocean Discovery Program (IODP) with the 
International Continental Scientific Drilling Program (ICDP)**”8 into 
the target rocks that constitute the peak ring at Chicxulub has now pro- 
vided unprecedented insight into target rock deformation, weakening 
mechanisms and peak-ring formation in large-scale impact cratering. 


Structural characteristics of target rock 

A total of 829 m of core was recovered from Expedition 364 bore- 
hole M0077A (Fig. 1d), starting at 506 m below sea floor (m.b.s.f.)7*78. 
The recovered core includes 112 m of post-impact pelagic carbonate 
rock, followed by 130 m of impact melt rock and suevite, and 587 m 
of pervasively shocked target rock. The target rock consists of coarse- 
grained, alkali-feldspar-rich granitoid rock hosting uniformly oriented, 
pre-impact mafic and felsic sheet intrusions (Extended Data Fig. 1). 
At depths between 1,220 and 1,316 m.b.s.f,, the target rock is mingled 
with impact melt rock on the decimetre to metre scale. Elsewhere in 
the target rock, impact melt rock is rather sparse. The mean density 
(2.41 g cm~3) and mean P-wave velocity (4.1 km s~!) of the target 
rock are considerably lower than those of typical felsic basement rocks 
(>2.6 gcm~* and >5.5 km s7')**”8, These petrophysical characteristics 
indicate substantial mechanical modification of the rock, notably in 
terms of increased porosity”. 
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Fig. 1 | Typical impact structures on the Moon (http://quickmap.lroc. 
asu.edu) and the geophysical characteristics of the Chicxulub 

impact structure. Topographically elevated areas in a—c are highlighted 
in magenta. a, Tycho (diameter 85 km) is a central-peak crater. 

b, Schrédinger* (diameter 312 km) is a peak-ring impact structure. 

c, Orientale (diameter 930 km) is a multi-ring impact basin. d, Combined 


The post-impact carbonate rock is unstrained. Pre-impact defor- 
mation, however, of the granitoid target rock is evident through the 
sporadic presence of weak shape-preferred orientations of alkali-feld- 
spar, plagioclase, quartz and biotite. The grain-shape alignment of these 
minerals formed under high-grade metamorphic conditions, as indi- 
cated by viscous deformation of feldspars and quartz”®. Crystal-plastic 
strain cannot account for the reduced density and P-wave velocity of 
the target rock. Consequently, impact processes, including the damage 
caused by the passage of the shock wave, and deformation during peak- 
ring formation, must have caused the anomalous geophysical properties 
of the target rock”’. 

Observed shock-induced structures in the target rock consist of shatter 
cones, microscopic planar deformation features and planar fractures 
in quartz and feldspars, as well as kinked biotite*®. Severe structural 
target rock modification is evident by brittle and viscous deformation 
structures, including: (1) pervasive, irregular grain-scale fractures, (2) 
zones of cataclasite and ultra-cataclasite, (3) striated shear faults, (4) 
crenulated mineral foliations, and (5) brittle-ductile band structures 
(Figs. 3 and 4). The formation of (1) to (3) substantially increases the 
volume of deformed rock and thus accounts for the observed reduction 
in density and P-wave velocity”. 

The spatial distribution of macroscopic deformation structures 
indicates highly heterogeneous deformation in the target rock (Fig. 3). 
Microscopic inspection of the granitoid target rock reveals the perva- 
sive presence of intra- and inter-granular dilation fractures displaying 
jigsaw fragment geometry (Fig. 4a). Zones of strongly comminuted 
material separate displaced mineral fragments (Fig. 4b). These cata- 
clasite zones range in thickness from millimetres to centimetres 
(Fig. 4a—c, g) and indicate local differential shearing during cataclastic 
deformation. Locally, cataclasite zones grade into, or are truncated by, 
flow-foliated ultra-cataclasite, characterized by alternating quartz- and 
feldspar-rich layers (Fig. 4d). Crystal-plastic distortion of plagioclase 
(Fig. 4e) and quartz (Fig. 4f) indicate that the target rock accumulated 
some plastic strain before pervasive fracturing and cataclastic flow. 
Zones of (ultra-)cataclasite and crude mineral foliations, defined by 
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M0077A 


Bouguer gravity and seismic line A’’ of the Chicxulub impact structure. 
Offshore seismic data’’ indicate that the Chicxulub peak ring roughly 
correlates with a gravity low. The location of drill hole M0077A on the 
peak ring is indicated. Half-arrows indicate the sense of displacement on 
faults. 


the shape-preferred orientation of biotite and coarse layers of quartz 
and feldspars, are sporadically kinked (Fig. 4g, h). In summary, cata- 
clastic deformation displays variable intensity throughout the cored 
target rock, which is evident by its localization and variable degree of 
comminution. 

A total of 602 shear faults, with well-defined slip lineations, were 
recorded in the granitoid target rock (Fig. 3), with the total number 
of shear faults being vastly higher. By contrast, only 13 shear faults 
with slip lineations were identified in the post-impact carbonate rock 
and consist of a few millimetre-long calcite fibres (Fig. 4i), typical of 
seismic stick-slip faulting. Slip lineations in the target rock, however, 
form pronounced ridges and grooves of strongly comminuted host 
rock material (Fig. 4j). Displacements on these faults may amount to 
several decimetres”®, Although the post-impact carbonate rock shows 
a weak tectonic overprint, it is evident that the granitoid target rocks 
underwent catastrophic and pervasive shear faulting. 

At 1,220 to 1,316 m.b.s.f., the target rock is strongly distorted 
and brecciated, and fragments of it are marginally resorbed and 
found in melt rock (Fig. 5a, b). Conversely, zones of brecciated tar- 
get rock host elongated, and frequently wispy, melt-rock fragments, 
reminiscent of suevite (Fig. 5c, d). Where in contact with target rock 
fragments, the melt rock underwent large ductile strains, as is clear 
from the highly stretched granitoid fragments contained in the melt 
rock (Fig. 5e). Overall, the melt rock is spatially associated with the 
highest-strained target rocks, indicated by breccia of thicknesses of 
decimetres to metres. The presence of exotic fragments (Fig. 5f)— 
consisting of gneiss, mafic igneous rock and various mylonites—in 
the melt rock excludes an in situ frictional melt origin for the melt 
rock. Breccia zones are substantially thicker and show a larger range 
in sizes and shapes of fragments than cataclasite and ultra-cataclasite 
zones in target rock outside this particular depth interval. The 
differences in thickness and fragment size between these breccia and 
the cataclasite zones indicate different fragmentation mechanisms 
and/or fragmentation at different times during the cratering process. 
Finally, the spatial density of ductile band structures is maximal within 
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Fig. 2 | Modelled formation of the Chicxulub impact structure. 

The mechanism is based on numerical modelling of peak-ring crater 
formation**?+4, A grid of tracer particles is shown to highlight the 
sub-crater deformation. Dark red area of crust in each panel tracks the 
material that eventually forms the peak ring. T denotes time in seconds 
after impact. Red half-arrows indicate the direction of major shear 
displacements relative to adjacent material. a, Undisturbed configuration 
of model lithosphere before impact. b, Cratering starts by shock-wave- 


this depth interval (Fig. 3). Brittle-ductile band structures occur pre- 
dominantly in mechanically and thermally weakened target and melt 
rock and form ductile shear zones (Fig. 5f), shear bands with C-S 
fabric geometry (Fig. 5g)?! and crenulated mineral fabrics (Fig. 4h). 


Chronology of deformation mechanisms 

Most importantly, it is possible to determine the relative timing of the 
various deformation mechanisms. Zones of (ultra-)cataclasite trun- 
cate the jigsaw fragment geometry of pervasively fractured target rock 
(Fig. 4a, b). Shear faults, in turn, consistently offset cataclasite and 
ultra-cataclasite zones (Figs. 4c and 5a, b). Target rock fragments in 
melt rock are sporadically striated and host cataclasite zones”*; whereas 
melt rock matrices are devoid of shear faults. Cataclasite and melt rock 
are found in tension fractures (Fig. 5h), which, to some extent, formed 
from shear faults. Brittle-ductile band structures displace zones of cata- 
clasite, crenulated foliation surfaces and the contacts of target rock with 
cataclasite and melt rock (Figs. 4g, h and 5f, g). In summary, pervasive 
fracturing of target rock was followed, respectively, by formation of 
(ultra-)cataclasite zones, shear faulting, emplacement of cataclasite 
and impact melt into dilatant fractures and formation of ductile band 
structures. 


Deformation mechanisms and cratering stages 

During the various cratering stages, deformation kinematics and 
states of stress of the target rock differ profoundly (Fig. 2). Therefore, 
distinct deformation mechanisms recognized in the target rock may 
well relate to individual cratering stages denoted in terms of time 
after impact. Shock and decompression causes irreversible plastic 
deformation and imparts to the shocked rocks a divergent outward 
velocity field, which forms the transient cavity. This velocity field 
causes wall-parallel extension and perpendicular shortening of the 
target rock (Fig. 2b). Rock deformation at upper-crustal pressures 
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induced, crustal-scale excavation of a bowl-shaped transient cavity. 

c, Gravitational instability of the transient cavity causes uplift of the crater 
centre and concomitant inward slumping of the cavity wall. d, Collapse 
and radial outward displacement of uplifted material over inward-slumped 
cavity wall segments followed by gravitational settling of the peak ring 
(inset) characterize the terminal phase of modelled crater modification. 
White lines indicate the approximate current erosion levels of the Sudbury 
and Vredefort impact structures. 


and depths, which is where peak-ring materials are derived from, is 
accommodated by fracturing. We therefore attribute pervasive frac- 
turing, which preceded the other deformation mechanisms, to shock 
loading, decompression and transient cavity growth (time after impact 
T<30s). 

After the transient cavity forms (Fig. 2b), gravitational collapse 
modifies the crater shape until the final crater morphology is reached 
(Supplementary Information). During initial collapse, the peak-ring 
material motion transitions from outward and divergent excavation 
flow to inward and convergent rock flow towards the crater centre. 
This inward movement leads to the incorporation of peak-ring material 
onto the flank of a central uplift (Fig. 2c). During this stage of cratering, 
peak-ring materials experience several distinct stress states (Extended 
Data Fig. 2). Planar zones of cataclasite and (ultra-)cataclasite are plau- 
sible candidates for accommodating the deformation of pre-fractured 
target rock during this cratering stage (20 s< T< 150s). 

During build-up of the central uplift (20 s < T < 100 s), the pres- 
sure on the peak-ring material increases (Extended Data Fig. 2). This 
increase inevitably closes asperities within the fractured rock and thus 
increases the internal friction of the target rock and normal stresses 
on faults. The central uplift eventually over-heightens and becomes 
gravitationally unstable, causing downwards and radial-outward col- 
lapse (160 s < T< 300s). In this motion, collapsed material piles up to 
form the peak ring, which is thrust over the inwardly slumped tran- 
sient cavity rim (Fig. 2d, Supplementary Information). Collectively, 
the increased pressure, combined with the reversal of the material dis- 
placement field as the central uplift transitions from motion upwards 
to outwards and downwards during collapse accounts for the observed 
transition from localized cataclastic flow to shear faulting during this 
stage of cratering. 

As the melt rock occurrences within the target rock are devoid of 
shear faults, melt emplacement must occur at the end of peak-ring 
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Fig. 3 | Spatial distribution of major lithological units and deformation 
structures in target rock of M0077A drill core. n is number of 
observations. We note the strong spatial correlation of increased numbers 


formation (250 s < T <600 s). Subsequent deformation is evident from 
the ductile band structures displacing contacts between the target and 
melt rock, zones of cataclasite and mineral foliations. Band orientation, 
the sense of displaced layers and fabric asymmetry, as displayed by 
sigmoidal foliation planes and cataclasite zone boundaries, consistently 
indicate band formation through normal faults (Figs. 4g, h and 5f, g). 
Respective vertical shortening and horizontal extension is consistent 
with gravitational spreading of the topographically elevated peak ring 
and signifies the final stage of crater modification (inset in Fig. 2d). 


Weakening mechanisms 

The recognition of distinct deformation mechanisms corresponding to 
the various stages of the cratering process is of fundamental importance 
in comprehending the mechanics of large-scale impact cratering. Initial 
pervasive grain-scale fracturing causes a profound loss of cohesion in 
target rocks at the onset of, and during, transient cavity growth. During 
cavity modification, strain is localized progressively through forma- 
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of (ultra-)cataclasite zones, crenulated foliations and ductile band 
structures below 1,220 m.b.s.f. 


tion of cataclasite zones, ultra-cataclasite zones, shear faults, and finally 
deformation on fault zones with impact-melt-bearing fault breccias. 
Progressive strain localization is evidence of the incremental regaining 
of shear and cohesive strength in the target rock, as crater modification 
proceeds. It has been proposed that crater collapse is facilitated by the 
self-lubrication of faults by frictional melts!®. We did not, however, 
uncover any evidence for friction-generated melt rock in the recovered 
target rock from the peak ring at Chicxulub. Hence, dynamic weaken- 
ing of faults, if important, appears to require a mechanism other than 
shear heating. 

Shock compression and dilation during initial impact caused whole- 
sale intra-crystalline damage (Fig. 4e, f). Thereafter, dynamic fracturing 
induced by the passage of the shock and rarefaction waves and transient 
cavity growth led to loss in cohesion and shear strength. The presence 
of pervasively fractured target rock with preserved microscopic jigsaw 
fragment patterns and uniform orientation of pre-impact dykes 
(Extended Data Fig. 1) indicate that target rock above 1,220 m.b.s.f. 
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Fig. 4 | Deformation structures in target rock at site M0077A. Arrow 
indicates the direction of the top of the drill core. Half-arrows indicate the 
sense of displacement on discontinuities. a, Photomicrograph in plane- 
polarized light showing pervasive cataclasite of granitoid target rock (core 
122-3, 820 m.b.s.f.). b, Line drawing of a showing alkali-feldspar (kf) 
displaced on cataclasite zone (c). c, Cataclasite zones displaced on shear 
faults (core 301-1, 1,326.45-1,326.57 m.b.s.f.). d, Flow-foliated ultra- 
cataclasite (core 215-2, 1,065.85-1,065.94 m.b.s.f.). e, Photomicrograph in 


behaved largely as a structurally coherent rock mass. The implication 
of small displacements across the entire rock mass is consistent with 
macroscopic deformation of an acoustically fluidized rock mass'®”?, 
Structural observations from the peak-ring target rocks of Chicxulub 
are generally consistent with acoustic fluidization as the dominant 
weakening mechanism and offer insight for the refinement of future 
impact simulations. Acoustic fluidization entails target rock blocks 
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cross-polarized light showing distorted twin lamellae in plagioclase (core 
129-1, 831.38-831.40 m.b.s.f.). f, Photomicrograph in cross-polarized 
light showing distorted quartz with planar deformation features (core 
129-1, 831.38-831.40 m.b.s.f.). g, Cataclasite zone segmented by normal 
faults (core 172-2, 956.41-956.45 m.b.s.f.). h, Crenulated layering in 
granitoid rock (core 122-1, 817.61-817.66 m.b.s.f.). i, Striated shear fault 
in carbonate rock. j, Striated shear fault in granitoid target rock (core 
154-1, 894.19 m.b.s.f.). 


undergoing pressure oscillations around the ambient lithostatic 
stress*”!0, During pressure lows, blocks have reduced normal stresses 
between them, drastically reducing frictional resistance at block bound- 
aries during periodic rock flow. During pressure highs, blocks are com- 
pressed, locally increasing the frictional resistance of the deforming 
rock mass. Cataclasite zones seem likely to be where the sheared block 
boundaries serve as contact strain zones during oscillation of target 
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Fig. 5 | Images illustrating rock types found between 1,220 and 1,316 
m.b.s.f. (g, highly distorted granitoid rock; m, impact melt rock; b, fault 
breccia; e, exotic fragments.) a, Line scan of core 265-2 (1,216.36-1,217.45 
m.b.s.f.), showing highly distorted and brecciated target rock mingled 
within melt rock. We note halos of mingled melt rock and fault breccia 

at the margins of granitoid rocks as well as shear faults (half-arrow) 
displacing thin zones of ultra-cataclasite. b, Line drawing of a. c, Line 

scan of core 285-1 (1,277.24-1,278.25 m.b.s.f.), displaying mingling of 
impact melt rock and fault breccia, notably near the granitoid fragment. 
We note melt rock fragments within fault breccia. d, Line drawing of c. 


rock blocks. Continued cataclasis, resulting in flow-foliated ultra- 
cataclasite, heralds an increase in shear strain of the rock mass and 
waning acoustic fluidization. While in motion, continued comminution 
in (ultra-)cataclasite zones may generate additional acoustic energy and 
prolong cataclastic flow”. 

A critical parameter in the acoustic fluidization model is the dom- 
inant wavelength of pressure vibrations’, which controls both the 
viscosity of the acoustically fluidized rock mass and the timescale for 
the decay of vibrations. The ‘block model of acoustic fluidization is 
employed in most Chicxulub-scale impact simulations*”?4, such as the 
one reproduced in Fig. 2. The block model supposes that the subcrater 
rock mass is dominated by blocks of a characteristic size that oscillate 
within a surrounding mass of breccia with a single vibrational wave- 
length (and period) that is directly proportional to the block size*”. The 
block model parameters employed in Chicxulub impact simulations 
imply a block size of about 100-500 m (depending on the assumed 
acoustic energy dissipation factor Q) and an oscillation frequency ofa 
few hertz. This prediction is consistent with the entire approximately 
450-m granite sequence above the imbricate thrust zone representing 
a single ‘block (Fig. 3). 
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e, Melt rock in contact with fault breccia. We note the gradient in contact 
strain, evident from the stretched target rock fragments in melt rock (core 
303-3, 1,334.24-1,334.35 m.b.s.f.). f, Ductile shear zone in mingled impact 
melt rock and fault breccia containing exotic fragments (core 289-1, 
1,289.75-1,289.87 m.b.s.f.). g, C-S fabric geometry in granitoid indicated 
by displaced planar mineral fabric in granitoid target rock (half arrows) 
amounting to vertical shortening and horizontal extension (white arrows) 
(core 273-2, 1,241.26-1,241.31 m.b.s.f.). h, Cataclasite entrained in dilatant 
fracture (core 262-1, 1,207.45-1,207.56 m.b.s.f.). 


On the other hand, if the cataclasite zones observed in the Chicxulub 
peak-ring drill core represent oscillating-block boundaries as we pro- 
pose, their average spacing (Fig. 3) of about 3.5 m (2.3 m including 
ultra-cataclasite zones) would imply a much smaller block size, shorter 
dominant vibrational wavelength and higher vibrational frequency'?”°. 
This would imply rapid evolution of the acoustic wave field during 
collapse of the crater, which is not predicted by the current block model 
implementation. High-frequency vibrations sustained for the dura- 
tion of crater collapse, however, could be explained by the efficient 
regeneration of acoustic energy during the cratering process, which is 
neglected in the block model. Effective regeneration of vibrations in a 
rapidly shearing rock mass is consistent with findings from discrete- 
element models of acoustic fluidization in landslides**. Alternatively, 
the acoustic wave field may evolve by progressive lengthening of the 
dominant vibrational wavelength during cratering as higher-frequency 
vibrations dissipate sooner. In this case, the effective block size could 
increase during crater formation from a few metres at the beginning of 
modification, when the first cataclasite zones are likely to have formed 
(20 s< T< 60s), to a few hundred metres by the end of peak-ring 
emplacement (T'< 600 s). 


© 2018 Springer Nature Limited. All rights reserved. 


A progressive waning of the acoustic wavefield in which slip events, 
facilitated by negative pressure excursions, become less frequent and 
more widely spaced is consistent with the temporal evolution of defor- 
mation observed in the drill core. This evolution suggests a progression 
from distributed, small-displacement deformation along closely spaced 
faults early in the cratering process to more localized, larger-displacement 
deformation along widely spaced slip surfaces later. Acoustic fluidiza- 
tion is, therefore, interpreted to halt at the onset of shear faulting, as 
target rock blocks cease to oscillate and the bulk rock mass regains 
internal friction and, thus, shear strength. Whether this cessation of 
acoustic fluidization occurs during the final emplacement of the peak 
ring (as suggested by current numerical simulations; Fig. 2d) or ear- 
lier, during the formation of the central uplift, is unclear. In the latter 
scenario, the outward collapse of the central uplift and thrusting of 
peak-ring rocks onto the transient cavity rim would have occurred 
after the rocks regained most of their large-scale static strength. In this 
case, the late stages of collapse could have been facilitated by large faults, 
lubricated by entrained impact melt. 


Peak-ring formation 

Modelling suggests that the target rock forming the peak ring resided 
at a depth” of about 10 km, before impact, and was entrained into a 
central uplift before being thrust outward over inward slumped tran- 
sient cavity wall segments (Fig. 2). From the modelled cratering flow 
(Supplementary Information), it is conceivable that individual target 
rock blocks may over-thrust portions of impact melt, notably where 
the peak ring develops. Impact melt may then become sandwiched 
between quasi-coherent target rock masses. Hence, impact melt in large 
craters may be present not only as ponded liquids at the surface, but 
also as melt bodies or sheets entrained and trapped in target rock thrust 
zones at depth. 

Structural and lithological characteristics of the rocks at depths 
between 1,220 and 1,316 m.b.s.f. are consistent with impact melt 
entrained in a prominent imbricate thrust zone (Fig. 3). Respective 
characteristics include: (1) the concentration of high strains in target 
rock and melt rock (Fig. 5a, e), (2) the strongly distorted target rock 
slivers mingled with melt rock and breccia, interpreted as fault breccia 
(Fig. 5a-d, f), (3) the occurrence of melt rock fragments in fault breccia 
(Fig. 5c, d, f), and (4) fragment lithologies not present in the adjacent 
target rock”®, Given that in situ frictional melting is excluded for the origin 
of the melt rock, formation of this rock by shock-induced melting 
and subsequent entrainment during peak-ring formation appears to 
be the more plausible explanation. Specifically, we propose that the 
target rock mass above 1,220 m.b.s.f. over-thrust and buried the impact 
melt overlying the deeper target rock, which is now found below 1,316 
m.b.s.f. Impact melt rock in contact with brecciated target rock displays 
large ductile strains (Fig. 5e) and indicates rapid cooling (quenching) 
and solidification of the impact melt during thrusting. In summary, 
imbricate thrusting (stacking) of target rock masses** contributed to 
the high topography of the peak ring. A prerequisite for thrusting is 
the regaining of shear strength in the target rock by the time of the 
formation of peak-ring topography. 


Consequences of dynamic weakening 

Examination of the deformation mechanisms of the target rocks 
underlying the peak ring at Chicxulub has provided unprecedented 
evidence for the physical mechanisms responsible for weakening and 
the regained strength of target rock during large-scale impact cratering. 
Results are strongly supportive of the dynamic collapse model (Fig. 2, 
Supplementary Information) of peak-ring formation and of acoustic 
fluidization as the dominant mechanism driving crater modifica- 
tion. The transition in deformation style from distributed cataclastic 
flow to localized shear-faulting and the progressive increase in fault 
spacing illuminates the waning of acoustic fluidization and the target 
regaining sufficient strength to support the topography of the peak 
ring. Dynamic weakening of faults or regeneration of acoustic energy 
may have an important role in this final phase of peak-ring formation. 
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Incorporating this insight into future numerical impact simulations 
will aid in the design of higher-fidelity models of large-scale impact 
cratering. 

In particular, we regard (ultra-)cataclasite zones, serving as contact 
strain zones of oscillating target rock blocks, as the physical manifesta- 
tion of pressure fluctuations. If so, the estimated average size of coherent 
target rock blocks within the Chicxulub peak ring is one to two orders 
of magnitudes smaller than observed in the central uplifts of smaller 
terrestrial complex craters*>-*”. This may imply efficient regeneration 
of pressure fluctuations during transient cavity collapse and modifica- 
tion or a growth in vibrational wavelength as the wavefield evolves. In 
either case, central peaks of smaller impact structures may be preserved 
because fluidization ceased early in the gravitational collapse process. 
By contrast, peak rings in peak-ring craters and multi-ring basins form 
because acoustic fluidization is sustained through the formation and 
collapse of an overheightened central uplift. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
https://doi.org/10. 1038/s41586-018-0607-z. 


Received: 28 February 2018; Accepted: 15 August 2018; 
Published online 24 October 2018. 


1. Croft, S. K. The modification stage of basin formation: conditions of ring 
formation. Geochim. Cosmochim. Acta 12A, 227-257 (1981). 

2. Grieve, R. A. F., Robertson, P. B. & Dence, M. R. Constraints on the formation of 
ring impact structures, based on terrestrial data. Geochim. Cosmochim. Acta 
12A, 37-57 (1981). 

3. Neumann, G. A. et al. Lunar impact basins revealed by gravity recovery 
and interior laboratory measurements. Sci. Adv. 1,e1500852 
(2015). 

4. Ivanov, B. A. Numerical modelling of the largest terrestrial meteorite craters. Sol. 
Syst. Res. 39, 381-409 (2005). 

5. Melosh, H. J. & lvanov, B. A. Impact crater collapse. Annu. Rev. Earth Planet. Sci. 
27, 385-415 (1999). 

6. O'Keefe, J.D. & Ahrens, T. J. Planetary cratering mechanics. J. Geophys. Res. 98, 
17011-17028 (1993). 

7. Winnemann, K. & Ivanov, B. A. Numerical modelling of the impact crater 
depth-diameter dependence in an acoustically fluidized target. Planet. Space 
Sci. 51, 831-845 (2003). 

8. Grady, D. E. & Kipp, M. E. in Fracture Mechanics of Rock (ed. Atkinson, B. K.) 
429-475 (Academic Press, London, 1987). 

9. Fujiwara, A. Kamimoto, G. & Tsukamoto, A. Destruction of basaltic bodies by 

high-velocity impact. Icarus 31, 277-288 (1977). 

0. Ahrens, T. J. & Rubin, A. M. Impact-induced tensional failure in rock. J. Geophys. 

Res. 98, 1185-1203 (1993). 

1. Buhl, E. et al. Particle size distribution and strain rate attenuation in 

hypervelocity impact and shock recovery experiments. J. Struct. Geol. 56, 20-33 

(2013). 

2. Collins, G.S. Numerical simulations of impact crater formation with dilatancy. 

J. Geophys. Res. 119, 2600-2619 (2014). 

3. Melosh, H. J., Ryan, E. V. & Asphaug, E. Dynamic fragmentation in impacts: 

hydrocode simulation of laboratory impacts. J. Geophys. Res. 97, 14,735- 
14,759 (1992). 

4. Kenkmann, T. Folding within seconds. Geology 30, 231-234 (2002). 

5. Kenkmann, T. Dike formation, cataclastic flow, and rock fluidization during 
impact cratering: an example from the Upheaval Dome structure, Utah. Earth 
Planet. Sci. Lett. 214, 43-58 (2003). 

6. Collins, G. S., Melosh, H. J. & lvanov, B. A. Modeling damage and deformation in 
impact simulations. Meteorit. Planet. Sci. 39, 217-231 (2004). 

7. Senft, L. E. & Stewart, S. T. Dynamic fault weakening and the formation of large 
impact craters. Earth Planet. Sci. Lett. 287, 471-482 (2009). 

8. Spray, J. G. Superfaults. Geology 25, 579-582 (1997). 

9. Melosh, H. J. Acoustic fluidization: a new geological process? J. Geophys. Res. 
84, 7513-7520 (1979). 

20. Melosh, H. J. Dynamical weakening of faults by acoustic fluidization. Nature 

379, 601-606 (1996). 

21. Grieve, R.A. F. & Therriault, A. Vredefort, Sudbury, Chicxulub: three of a kind? 
Annu. Rev. Earth Planet. Sci. 28, 305-338 (2000). 

22. Grieve, R.A. F., Reimold, W. U., Morgan, J., Riller, U. & Pilkington, M. Observations 
and interpretations at Vredefort, Sudbury and Chicxulub: towards a composite 
kinematic model of terrestrial impact basin formation. Meteorit. Planet. Sci. 43, 
855-882 (2008). 

23. Collins, G.S., Melosh, H. J., Morgan, J. V. & Warner, M. R. Hydrocode simulations 
of Chicxulub crater collapse and peak-ring formation. Icarus 157, 24-33 
(2002). 

24. Morgan, J. V. et al. The formation of peak rings in large impact craters. Science 
354, 878-882 (2016). 


25 OCTOBER 2018 | VOL 562 | NATURE | 517 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


25. Morgan, J. V., Warner, M. R., Collins, G. S., Melosh, H. J. & Christeson, G. L. 


Peak-ring formation in large impact craters: geophysical constraints from 
Chicxulub. Earth Planet. Sci. Lett. 183, 347-354 (2000). 

26. Morgan, J. V. et al. Full waveform tomographic images of the peak ring at the 
Chicxulub impact crater. J. Geophys. Res. 116, B06303 (2011). 

27. Gulick, S. P. S. et al. Importance of pre-impact crustal structure for 


the asymmetry of the Chicxulub impact crater. Nat. Geosci. 1, 131-135 

(2008). 

28. Morgan, J. V. et al. Chicxulub: drilling the K-Pg impact crater. In Proceedings of 

the International Ocean Discovery Program 364 https://doi.org/10.14379/iodp. 

proc.364.2017 (International Ocean Discovery Program, College Station, 

2017). 

29. Christeson, G. L. et al. Extraordinary rocks from the peak ring of the Chicxulub 

impact crater: P-wave velocity, density, and porosity measurements from IODP/ 

CDP Expedition 364. Earth Planet. Sci. Lett. 495, 1-11 (2018). 

30. Petit, J. P. Criteria for the sense of movement on fault surfaces in brittle rocks. 

J. Struct. Geol. 9, 597-608 (1987). 

31. Berthé, D., Choukroune, P. & Jegouzo, P. Orthogneiss, mylonite and non-coaxial 

deformation of granites: the example of the South Armorican Shear Zone. 

J. Struct. Geol. 1, 31-42 (1979). 

32. Ivanov, B. A. & Artemieva, N. A. in Catastrophic Events and Mass Extinctions: 
Impact and Beyond (eds C. Koeberl, C. & MacLeod, K. G.) Geological Society of 
America Special Paper 356, 619-630 (GSA, 2002). 

33. Johnson, B. C., Campbell, C. S. & Melosh, H. J. The reduction of friction in long 
runout landslides as an emergent phenomenon. J. Geophys. Res. 121, 881-889 
(2016). 

34. Kring, D.A., Kramer, G. Y., Collins, G. S., Potter, R. W. K. & Chandnani, M. 

Peak-ring structure and kinematics from a multi-disciplinary study of the 

Schrédinger impact basin. Nat. Commun. 7, 13161 (2016). 

35. Ivanov, B. A., Kocharyan, G. G. & Kostuchenko, V. N. Puchezh-Katunki impact 

crater: preliminary data on recovered core block structure. In Proc. 27th Lunar 

and Planetary Science Conf. 589-590, https://www.|pi.usra.edu/meetings/ 

psc1996/pdf/1295.pdf (1996). 

36. Kenkmann, T., Collins, G. S. & Wiinnemann, K. in /mpact Cratering: Processes and 

Products (eds Osinski, G. R. & Pierazzo, E.) 60-75 John Wiley & Sons, 

Chichester, 2013). 

37. Rae, A. S. P., Collins, G. S., Grieve, R. A. F., Osinski, G. R. & Morgan, J. V. Complex 
crater formation: insights from combining observations of shock pressure 
distribution with numerical models at the West Clearwater Lake impact 
structure. Meteorit. Planet. Sci. 52, 1330-1350 (2017). 


Acknowledgements This work was supported by the Priority Programs 

527 and 1006 of the German Science Foundation (grants Ri 916/16-1 

and PO 1815/2-1), National Science Foundation grants (OCE-1737351, 
OCE-1450528 and OCE-1736826), and Natural Environment Research 
Council (grants NE/PO11195/1 and NE/P005217/1). The Chicxulub drilling 
expedition was funded by the European Consortium for Ocean Research 
Drilling (ECORD) and the IODP as Expedition 364 with co-funding from the 
ICDP. The Yucatan State Government and Universidad Nacional Auténoma de 
México (UNAM) provided logistical support. This research used samples and 
data provided by IODP. Samples can be requested at http://web.iodp.tamu. 
edu/sdrm. We are grateful for assistance from the staff of the IODP Core 
Repository in Bremen, Germany, during the Onshore Science Party. We thank 
B. Ivanov and C. Koeberl for constructive reviews and S. Teuber for assistance 
in figure preparation. This is UTIG contribution number 3,278. 


Reviewer information Nature thanks B. lvanov and C. Koeberl for their 
contribution to the peer review of this work. 


Author contributions U.R., M.H.P,, A.S.PR., J.V.M., S.P.S.G. and R.A.F.G. conceived 
the study. All authors participated in sampling and data collection offshore 


518 | NATURE | VOL 562 | 25 OCTOBER 2018 


and/or onshore during IODP-ICDP Expedition 364, interpretation of the data 
as well as writing and/or editing of the manuscript. U.R. provided the first draft 
of the manuscript. U.R. and F.M.S. acquired structural data from line scans. J.L. 
and A.D. provided the downhole orientation data. A.S.P.R and G.S.C. performed 
and analysed the numerical models; G.S.C., A.S.P.R. and H.J.M. contributed the 
discussion on the implications for acoustic fluidization. 


Competing interests The authors declare no competing interests. 


Additional information 

Extended data is available for this paper at https://doi.org/10.1038/s41586- 
018-0607-z. 

Supplementary information is available for this paper at https://doi.org/ 
10.1038/s41586-018-0607-z. 

Reprints and permissions information is available at http://www.nature.com/ 
reprints. 

Correspondence and requests for materials should be addressed to UR. 
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. 


IODP-ICDP Expedition 364 Science Party 
Joanna V. Morgan?, Sean P. S. Gulick®”, Sophie L. Green!!, Johanna Lofi8, 

Elise Chenot!2, Gail L. Christeson®, Philippe Claeys!%, Charles S. Cockell!4, 
Marco J. L. Coolen!5, Ludovic Ferriére!®, Catalina Gebhardt!”, Kazuhisa Goto}, 
Heather Jones!9, David A. Kring, Long Xiao?°, Christopher M. Lowery®’, 
Rubén Ocampo-Torres#!, Ligia Perez-Cruz22, Annemarie E. Pickersgill23:24, 
Michael H. Poelchau2°, Auriol S. P. Rae2®, Cornelia Rasmussen®’, 

Mario Rebolledo-Vieyra2’, Ulrich Riller?, Honami Sato28, Jan Smit?29, 

Sonia M. Tikoo-Schantz2°, Naotaka Tomioka®!, Michael T. Whalen?2, 

Axel Wittmann, Kosei Yamaguchi?*35, Jaime Urrutia Fucugauchi22 

& Timothy J. Bralower!? 


British Geological Survey, The Lyell Centre, Research Avenue South, Edinburgh, UK. 
12Université de Bourgogne-CNRS, Biogeosciences Laboratory, Dijon, France. Analytical, 
Environmental and Geochemistry (AMGC), Vrije Universiteit Brussel (VUB), Brussels, Belgium. 
14S chool of Physics and Astronomy, UK Center for Astrobiology, University of Edinburgh, 
Edinburgh, UK. 5Western Australia Organic and Isotope Geochemistry Centre, School of Earth 
and Planetary Sciences, Curtin University, Bentley, Western Australia, Australia. !©Natural 
History Museum, Vienna, Austria. !7Alfred Wegener Institute Helmholtz Centre of Polar and 
Marine Research, Bremerhaven, Germany. !8International Research Institute of Disaster 
Science, Tohoku University, Sendai, Japan. '?Pennsylvania State University, University Park, PA, 
USA. 2°China University of Geosciences (Wuhan), School of Earth Sciences, Planetary Science 
Institute, Wuhan, China. 2!National Center of Scientific Research (CNRS), Groupe de Physico- 
Chimie de l’Atmosphére, Institut de Chimie et Procédés pour l’Energie, l'Environnement 

et la Santé ICPEES, Université de Strasbourg, Strasbourg, France. Instituto de Geofisica, 
Universidad Nacional Auténoma De México, México City, Mexico. ??School of Geographical 
and Earth Sciences, University of Glasgow, Glasgow, UK. 24Argon Isotope Facility, Scottish 
Universities Environmental Research Centre (SUERC), East Kilbride, UK. “°Department of 
Geology, University of Freiburg, Freiburg, Germany. 2°Department of Earth Science and 
Engineering, Imperial College London, London, UK. 2’Unidad de Ciencias del Agua, Mérida, 
Mexico. @8Japan Agency for Marine-Earth Science and Technology, Yokosuka, Japan. 22Faculty 
of Earth and Life Sciences, Amsterdam, The Netherlands. °°Earth and Planetary Sciences, 
Rutgers University—New Brunswick, Piscataway, NJ, USA. ?Japan Agency for Marine-Earth 
Science and Technology, Kochi Institute for Core Sample Research, Kochi, Japan. Department 
of Geosciences, University of Alaska Fairbanks, Fairbanks, AK, USA. %Eyring Materials Center, 
Arizona State University, Tempe, AZ, USA. °4Department of Chemistry, Tohu University, 
Funabashi, Japan. 35NASA Astrobiology Institute, Mountain View, CA, USA. 


© 2018 Springer Nature Limited. All rights reserved. 


METHODS 


Acquisition of structural data from drill core. In addition to the methods 
employed for visual appraisal as well as meso- and microstructural analyses of the 
drill core during the Onshore Science Party*’, the following analyses were con- 
ducted. On the basis of a detailed examination of drill core line-scans, the occurrence 
of cataclasite zones, ultra-cataclasite zones, crenulated foliations and ductile band 
structures was recorded with depth. Only zones of (ultra-)cataclasite displaying a 
thickness of 1 cm and larger were recorded. Distinction between the two types of 
cataclasite is based on grain size, the presence of flow foliation and the fragment-size 
distribution. Overall, ultra-cataclasite appears darker than cataclasite. Mesoscopic 
shear faults displaying slip lineations and slip sense were identified by carefully 
removing core sections from the liners. Statistical analysis of the spatial occurrence 
of the structures was conducted with Microsoft Excel (see Source Data for Fig. 3). 
Microstructural analysis. Polished thin sections of thickness 25 {um were produced 
from selected target rock samples at the Institute of Mineralogy and Petrography 
of the University of Hamburg, Germany. Microscopic inspection of thin sections 
was conducted using a Zeiss Axio Scope.A1 polarization microscope and attached 
high-resolution digital camera AxioCam MRc Rev. 3 FireWire. 
Borehole imaging of planar structures. During Expedition 364, both optical and 
acoustic borehole images of the borehole walls were acquired*®. Post-acquisition 
processing and analysis allowed manual picking of the planar structural disconti- 
nuities, corresponding to pre-impact igneous sheet intrusions, and determination 
of their orientation. Orientations have not been corrected from borehole deviation, 
which departs less than 4° from the vertical. For visualization and processing of 
borehole images, the ALT WellCAD (https://www.alt.lu/products-wellcad/) soft- 
ware package was used. For analysis of orientation of pre-impact sheet intrusions 
the software package Tectonics FP version 1.6 was used”. 
Numerical modelling. To aid interpretation of the drill core data, we reproduced 
and reprocessed the numerical simulation of the Chicxulub impact”, which was 
in turn based on previous Chicxulub impact simulations that produced a good 
match to geological and geophysical constraints*”*>?"". The impactor parameters 
of the model were: diameter 14 km, velocity 12 km Ss, density 2,630 kg mA 
vertical incidence impact angle was enforced by the cylindrical geometry of the 
two-dimensional model. A spatial resolution of 200 m was used, corresponding to 
35 cells across the impactor radius. A simplified target structure was used of 3 km 
(carbonate) cover rocks and 30 km (granite) crust overlying (dunite) mantle. The 
simulation duration was 600 s of model time. We refer to ref. 4 for a full description 
of the modelling approach, including a comprehensive list of model parameters. 
Simulations were processed to examine the motion and pressure of peak-ring 
materials (Fig. 2a—-d, Extended Data Fig. 2, Supplementary Video). Lagrangian 
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tracer particles employed in the numerical method allow the history of material 
that ends up within the peak ring to be recorded and interrogated. Ref. 74 used 
tracer particles to illustrate the peak pressure and provenance of the peak-ring 
materials, as well as its motion during crater formation. Here, we identified a sub- 
set of 100 tracer particles within the same peak-ring material, initially located 
within a square (2 km x 2 km) cross-section at a depth of 10 km and a radius of 
16 km (see Supplementary Information, T= 0). The Supplementary Video shows 
the motion of these tracers during cratering in both the fixed simulation reference 
frame (main image) and in a Lagrangian reference frame, centred on the average 
location of the 100 tracers (inset). The inset image gives a qualitative sense of the 
internal deformation of the peak-ring materials and highlights the deformation 
kinematics of peak-ring material during cratering. 

Additionally, we analysed the pressure recorded by each tracer (circles) within the 
same volume, as well as the average pressure (solid line), as a function of time during 
the simulation (Extended Data Fig. 2). After the brief passage of the shock wave 
(P> 10 GPa; T< 5), the pressure in the peak-ring materials rises from 10-20 MPa 
to 50-100 MPa between about 100 s and about 250 s, before returning back to 
10-20 MPa. Thus, the inward collapse of the peak-ring materials towards the 
central uplift and the subsequent outward collapse are associated with elevated 
pressures, above the ultimate overburden pressure in the peak-ring materials at 
their final location. We note that pressure waves caused by shockwave reflections 
from the numerical domain boundaries, which would not be present in reality, 
are superimposed on the pressure-time signal after about 130 s. While these com- 
plicate interpretation, the elevated pressure for the two minutes of central uplift 
formation and collapse is a robust outcome of the model that is insensitive to the 
location of the domain boundary. 


Data availability 

All data generated or analysed during this study are included in this published 
Article. Other Expedition 364 data are available online (https://doi.org/10.14379/ 
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Extended Data Fig. 1 | Lower-hemisphere, equal-area diagrams showing poles to pre-impact aplite, diabase and pegmatite sheet intrusions. 
N, north. n, number of dykes. 
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Extended Data Fig. 2 | Diagram showing pressure versus time as solid line shows average pressure (all tracer particles). We note the elevated 
recorded by 100 Lagrangian tracer particles in the peak-ring rocks. pressures between T= 100s and T= 250 s during central uplift formation 
(See Supplementary Video for location of tracer particles). Grey circles and collapse. 
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Options for keeping the food system 
within environmental limits 
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The food system is a major driver of climate change, changes in land use, depletion of freshwater resources, and pollution 
of aquatic and terrestrial ecosystems through excessive nitrogen and phosphorus inputs. Here we show that between 
2010 and 2050, as a result of expected changes in population and income levels, the environmental effects of the food 
system could increase by 50-90% in the absence of technological changes and dedicated mitigation measures, reaching 
levels that are beyond the planetary boundaries that define a safe operating space for humanity. We analyse several 
options for reducing the environmental effects of the food system, including dietary changes towards healthier, more 
plant-based diets, improvements in technologies and management, and reductions in food loss and waste. We find that 
no single measure is enough to keep these effects within all planetary boundaries simultaneously, and that a synergistic 
combination of measures will be needed to sufficiently mitigate the projected increase in environmental pressures. 


The global food system is a major driver of climate change’”, land-use 
change and biodiversity loss**4, depletion of freshwater resources”®, and 
pollution of aquatic and terrestrial ecosystems through nitrogen and 
phosphorus run-off from fertilizer and manure application’”~’. It has 
contributed to the crossing of several of the proposed ‘planetary bound- 
aries’ that attempt to define a safe operating space for humanity on a 
stable Earth system'”-?, in particular those concerning climate change, 
biosphere integrity, and biogeochemical flows related to nitrogen and 
phosphorous cycles. If socioeconomic changes towards Western con- 
sumption patterns continue, the environmental pressures of the food 
system are likely to intensify'*-!°, and humanity might soon approach 
the planetary boundaries for global freshwater use, change in land use, 
and ocean acidification'!!*!”. Beyond those boundaries, ecosystems 
could be at risk of being destabilized and losing the regulation functions 
on which populations depend!!"”. 

Here we analyse the option space available for the food system to 
reduce its environmental impacts and stay within the planetary bound- 
aries related to food production. We build on existing analyses that 
have advanced the planetary-boundary framework in terms of systemic 
threats to large-scale ecosystems! !!*!8~?° discussed the role of agricul- 
ture with respect to those pressures!®”!, and analysed the impacts on 
individual environmental domains”*”’, including selected measures 
to alleviate those impacts””-**. The planetary-boundary framework 
is not without criticism, particularly because of the heterogeneity of 
the different boundaries and their underlying scientific bases, includ- 
ing the difficulty of defining global ecosystem thresholds for local 


environmental impacts”*-?’. Despite these limitations, we consider 
the planetary-boundary framework to be useful for framing, in broad 
terms, the planetary option space that preserves the sustainability of 
key ecosystems. We acknowledge the ongoing debate by quantifying the 
planetary boundaries of the food system in terms of broad ranges that 
reflect methodological uncertainties (see Methods), and by reporting 
the environmental impacts in absolute terms (for example, emissions 
in tonnes of carbon dioxide equivalents), which allows for comparisons 
to other measures of environmental sustainability. 

We advance the present state of knowledge by constructing and 
calibrating a global food-systems model with country-level detail that 
resolves the major food-related environmental impacts and includes 
a comprehensive treatment of measures for reducing these impacts 
(see Methods). The regional detail of the model accounts for different 
production methods and environmental impacts that are linked by 
imports and exports of primary, intermediate and final products. We 
use the food-system model and estimates of present and future food 
demand to quantify food-related environmental impacts at the country 
and crop level in 2010 and 2050 for five environmental domains and the 
related planetary boundaries: greenhouse-gas (GHG) emission related 
to climate change; cropland use related to land-system change; fresh- 
water use of surface and groundwater; and nitrogen and phosphorus 
application related to biogeochemical flows. 

To characterize pathways towards a food system with lower envi- 
ronmental impacts that stays within planetary boundaries, we connect 
a region-specific analysis of the food system to a detailed analysis of 
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Fig. 1 | Present (2010) and projected (2050) environmental pressures 
on five environmental domains divided by food group. Environmental 
pressures are allocated to the final food product, accounting for the use 
and impacts of primary products in the production of vegetable oils and 
refined sugar, and for feed requirements in animal products. Impacts are 
shown as percentages of present impacts, given a baseline projection to 
2050 without dedicated mitigation measures for a middle-of-the-road 
socioeconomic development pathway (SSP2). Absolute impacts for all 
socioeconomic pathways are provided in the main text and the data 
referred to in the ‘Data availability’ statement (see Methods). 


measures of change, including reductions in food loss and waste, tech- 
nological and management-related improvements, and dietary changes 
towards healthier, more plant-based diets (Extended Data Table 1). The 
scenarios regarding food loss and waste align with and exceed commit- 
ments made as part of the United Nations’ Sustainable Development 
Goals*8*°. The scenarios concerning technological change account for 
future improvements in agricultural yields and fertilizer application, 
increases in feed efficiency, and changes in management practices*!*. 
Finally, the scenarios around dietary change include changes towards 
dietary guidelines and more plant-based dietary patterns that are in 
line with present evidence on healthy eating**-*’. 

In our baseline trajectory, we account for different socioeconomic 
pathways of population and income growth®’, and project future 
demand for environmental resources in the absence of technological 
changes and dedicated mitigation measures. Although some of the 
measures of change considered here can be expected to be implemented 
by 2050, their level of ambition is uncertain and implementation will 
not happen automatically. We therefore analyse each measure of change 
explicitly and differentiate between two degrees of implementation: 
medium and high ambition. Measures of medium ambition are in line 
with stated intentions (for example, reducing food loss and waste by 
half), and measures of high ambition go beyond expectations but can 
be considered attainable with large-scale adoption of existing best prac- 
tices (for example, reducing food loss and waste by 75%). 


Environmental impacts of the food system 

Our analysis indicates that current and projected levels of agricul- 
tural production, in the absence of targeted mitigation measures, will 
greatly affect the Earth’s environment. We estimate that, in 2010, the 
food system emitted roughly the equivalent of 5.2 billion tonnes of 
carbon dioxide in GHG emissions in the form of methane and nitrous 
oxide; the food system also occupied 12.6 million kim’ of cropland, used 
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1,810 km? of freshwater resources from surface and groundwater 
(bluewater), and applied 104 teragrams of nitrogen (TgN) and 
18 teragrams of phosphorus (TgP) in the form of fertilizers (see 
Methods, ‘Data availability’). Our estimates are comparable to 
previous estimates of food-related GHG emissions'*® of 4.6—-5.8 
billion tonnes of carbon dioxide equivalents, global cropland use*® 
of 12.2-17.1 million km? in 2000, bluewater use>?° in 2000 of 
1,700-2,270 km, and nitrogen*® and phosphorus**"! application 
in 2010 of 104 TgN and 15.8-18.8 TgP. 

Food production and consumption are projected to change between 
2010 and 2050 (Extended Data Table 2) as a result of expected socioec- 
onomic developments (Supplementary Table 1). Those developments 
include the growth of the global population by about a third (with a 
range of 23-45%, from 6.9 billion in 2010 to 8.5-10 billion in 2050) and 
a tripling of global income (with a range of 2.6-4.2, from US$68 trillion 
in 2010 to US$180-290 trillion in 2050)°*. Because of these changes, 
we predict the environmental pressures of the food system to increase 
by 50-92% for each indicator in the absence of technological change 
and other mitigation measures (Fig. 1). The greatest increases along 
this baseline pathway are projected for GHG emissions (87%, range 
80-92%), then for the demand for cropland use (67%, range 66-68%), 
bluewater use (65%, range 64-65%), phosphorus application (54%, 
range 51-55%) and nitrogen application (51%, range 50-52%). 

Specific food groups vary in their environmental impacts (Fig. 1). 
The production of animal products generates the majority of food- 
related GHG emissions (72-78% of total agricultural emissions), which is 
due to low feed-conversion efficiencies, enteric fermentation in ruminants, 
and manure-related emissions*’; the feed-related impacts of animal 
products also contribute to bluewater use (around 10%) and pressures 
on cropland, as well as nitrogen and phosphorus application (20-25% 
each). By comparison, staple crops have generally lower environmental 
footprints (impacts per kg of product) than animal products (Extended 
Data Table 3), in particular for GHG emissions, but they can have high 
total impacts because of their higher production volumes (Extended 
Data Table 2). According to our estimates, staple crops grown for human 
consumption are responsible for a third to a half (30-50%) of cropland 
use, bluewater use, and nitrogen and phosphorus application. The pro- 
jected population growth between 2010 and 2050 contributes to a general 
increase in the impacts of each food group, and the projected income 
growth changes the relative contribution of each, with a shift towards 
a larger proportion of impacts from animal products (7-16% increase 
across environmental domains) and fruits and vegetables (2-28% 
increase), and a smaller proportion from staple crops (7-19% reduction). 


Changes in food management, technology and diets 
Reducing food loss and waste is one measure for reducing food demand 
and the associated environmental impacts. At present it is estimated 
that more than a third of all food that is produced is lost before it 
reaches the market, or is wasted by households”®. For our analysis, we 
evaluated the impacts of reducing food loss and waste to one half—a 
value in line with pledges made as part of the Sustainable Development 
Goals”*—and we also considered a reduction in food loss and waste by 
75%, which is probably close to the maximum theoretically avoidable 
value*”. We estimate that halving food loss and waste would reduce 
environmental pressures by 6-16% compared with the baseline pro- 
jection for 2050, and that reducing food loss and waste by 75% would 
reduce environmental pressures by 9-24% (Fig. 2). Relatively more sta- 
ple crops and fruits and vegetables are wasted than animal products”, 
which explains why the impacts of changes in food loss and waste are 
smaller for the livestock-dominated domains, such as GHG emissions, 
than for the staple-crop-dominated ones, such as cropland and blue- 
water use and nitrogen and phosphorus application. 

Technological changes increase the efficiency of production and 
reduce the environmental impact per unit of food produced. We ana- 
lysed the most commonly considered technological advances and 
changes in management practices with respect to their environmen- 
tal impacts (Extended Data Table 1). The measures include: increases 
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Fig. 2 | Impacts of reductions in food loss and waste, technological 
change, and dietary changes on global environmental pressures in 
2050. These projections of environmental pressures in 2050 are baseline 
projections without dedicated mitigation measures for a middle-of- 
the-road development pathway, and are expressed as percentages of 
present impacts (see Fig. 1). The different measures of change and their 
combination are depicted as reductions from the baseline projections 

for the different environmental domains (for example, the ‘diets’ bar that 
ends at 90% of present impacts of GHG emissions indicates that ambitious 
dietary changes (flexitarian) can reduce the projected increase of GHG 
emissions from 187% of present impacts to 90%, which represents a 
reduction of 52% or 97 percentage points; and dietary changes of medium 
ambition (guidelines), which in the figure end at the split line of the 

‘diets’ bar, can reduce GHG emissions from 187% of present impacts to 
133%, which represents a reduction of 29% or 54 percentage points). 


in agricultural yields, which reduce the demand for additional crop- 
land*°; rebalancing of fertilizer application between overapplying 
and underapplying regions*’, as well as increasing nitrogen-use 
efficiency**8 and phosphorus recycling’, which reduce demand for 
additional nitrogen and phosphorus inputs; improvements in water 
management that increase basin efficiency, storage capacity, and 
better utilization of rainwater*?; and agricultural mitigation options, 
including changes in irrigation, cropping and fertilization that reduce 
methane and nitrous oxide emissions from rice and other crops, and 
changes in manure management, feed conversion and feed additives 
that reduce enteric fermentation in livestock*!. We estimate that imple- 
menting these measures could reduce the environmental pressures of 
the food system by 3-30% compared with the 2050 baseline projec- 
tion in medium-ambition scenarios, and by 11-54% in high-ambition 
scenarios (Fig. 2). In each case, the higher-end estimates are for the 
staple-crop-dominated environmental indicators (cropland and 
bluewater use, and nitrogen and phosphorus application), for which 
general improvements in water management, agricultural yields, 
phosphorus-recycling rates and nitrogen-use efficiencies are particularly 
effective. The lower-end estimates are for GHG emissions, for which the 
contribution from livestock-related emissions is, to a large extent, an 
inherent characteristic of the animals and therefore cannot be reduced 
more substantially through existing mitigation options?!“ (Extended 
Data Table 4). 

Dietary changes towards healthier diets can reduce the environ- 
mental impacts of the food system when environmentally intensive 
foods, in particular animal products, are replaced by less intensive food 
types'>'®. For our analysis, we analysed dietary changes towards diets 
in line with global dietary guidelines for the consumption of red meat, 
sugar, fruits and vegetables, and total energy intake*>*®; as well as to 
more plant-based (flexitarian) diets that more comprehensively reflect 
the current evidence on healthy eating?” by including lower amounts 
of red and other meats and greater amounts of fruits, vegetables, nuts 
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The loss and waste scenarios include reducing food loss and waste by 

half (waste/2) and by 75% (waste/4). The technology scenarios include 
medium-ambition technological changes up to 2050 (tech) and more 
ambitious technological changes (tech+). The diet scenarios include diets 
aligned with global dietary guidelines (guidelines), and more plant-based 
flexitarian diets (flexitarian) that are reflective of present evidence on 
healthy eating. The scenario combinations include all measures of medium 
ambition (comb(med): waste/2, tech, guidelines) and all measures of high 
ambition (comb(high): waste/4, tech+, flexitarian), the latter including 
an optimistic socioeconomic development pathway with higher income 
and lower population growth. The diamonds indicate mean planetary- 
boundary values (boundary), each associated with uncertainty intervals 
highlighted by colour (light green, below the mean value; light orange, 
between minimum and maximum values; light red, above maximum 
values). 


and legumes (Extended Data Tables 1 and 5). We estimate that, com- 
pared with the baseline projection for 2050, dietary changes towards 
healthier diets could reduce GHG emissions and other environmental 
impacts by 29% and 5-9%, respectively, for the dietary-guidelines sce- 
nario, and by 56% and 6-22%, respectively, for the more plant-based 
diet scenario (Fig. 2). The changes are in line with the dietary compo- 
sition of the diets and the environmental footprints of each food group 
(Fig. 1, Extended Data Table 1 and Supplementary Table 2). Changes in 
meat consumption dominate the impacts on GHG emissions, while for 
the other domains the environmental pressures associated with greater 
consumption of fruits, vegetables, nuts and legumes are more important 
but outweighed by the environmental benefits associated with lower 
consumption of meat, staple crops and sugar, and a generally lower 
energy intake in line with healthy body weights and recommended 
levels of physical activity*® (Extended Data Table 6). 

To understand how the combined implementation of some or all of 
the discussed measures could influence the environmental pressures 
of the food system, we constructed an environmental option space by 
combining all measures of medium ambition and all measures of high 
ambition. Our analysis indicates that much of the increase in environ- 
mental pressures that is expected to occur by 2050 could be mitigated if 
measures were combined (Fig. 2). Combining all measures of medium 
ambition could reduce environmental pressures by around 25-45% 
compared with the baseline projection for 2050, resulting in total 
environmental impacts that are within 15% above and below present 
impacts. Combining all measures of high ambition could deliver reduc- 
tions of 30-60%, resulting in environmental impacts that are 20-55% 
less than the current ones. In line with the differentiated impacts of the 
different measures of change, dietary change contributes the most to 
the reductions in GHG emissions, and technological and management- 
related changes contribute the most to reductions in the other environ- 
mental impacts, while reductions in food loss and waste contribute up 
to a third to the overall reductions (Extended Data Fig. 1). 
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Fig. 3 | Planetary option space. The figure shows combinations of 

dietary change, technological change (tech or tech+), changes in food 

loss and waste (waste/2 or waste/4), and socioeconomic development 
pathways (SSP1, SSP2 or SSP3). These changes are applied to baseline 
conditions in 2050 (baseline). The diet scenarios include diets aligned with 
global dietary guidelines (guidelines), and more plant-based flexitarian 
diets (flexitarian) that are reflective of the current evidence on healthy 
eating. The loss and waste scenarios include reducing food loss and 

waste by half (waste/2) and by 75% (waste/4). The technology scenarios 
include medium-ambition technological changes up to 2050 (tech) and 


Planetary option space 

What level of reduction in environmental pressures should be aimed 
for? We can explore this question through comparison to the associ- 
ated planetary boundaries that are intended to describe a safe operating 
space for humanity. For our analysis, we adapted or newly quantified 
the food-related planetary-boundary values, including upper and lower 
limits (Extended Data Table 7, Extended Data Fig. 2 and Methods). 
According to our quantification, the planetary boundaries define a space 
around the present values for most environmental domains, with a mean 
value slightly below present values for food-related GHG emissions, at 
current values for cropland use, slightly above present values for bluewa- 
ter use, and substantially below present values for nitrogen and phospho- 
rus application (Fig. 2). Following the baseline trajectory of population 
and income change, and the related changes in food consumption and 
production, would lead to all mean values of the planetary bounda- 
ries being crossed. The environmental impacts of the food system 
would exceed the planetary boundaries for food-related GHG emissions 
by 110%, for cropland use by 70%, for bluewater use by 50%, for nitrogen 
application by 125%, and for phosphorus application by 75%. 

Our analysis indicates that staying within planetary boundaries is 
possible with a combination of measures of high ambition for GHG 
emissions and nitrogen and phosphorus application, and with a com- 
bination of measures of medium ambition for cropland and bluewater 
use (Fig. 2). An analysis of the planetary option space details the 
possible combination of measures (Fig. 3). It shows that staying within 
the mean value of the GHG boundary requires ambitious dietary 
change towards more plant-based, flexitarian diets, in combination with 
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more ambitious technological changes (tech+). The socioeconomic 
development pathways include a middle-of-the-road development 
pathway (SSP2), a more optimistic one with higher income and lower 
population growth (SSP1), and a more pessimistic one with lower income 
and higher population growth (SSP3). Colours and numbers indicate 
combinations that are below the lower bound of the planetary-boundary 
range (dark green, 1), below the mean value but above the minimum value 
(light green, 2), above the mean value but below the maximum (orange, 3), 
and above the maximum value (red, 4). 


either reductions in food loss and waste or technological improvements; 
staying within the mean values of the cropland and bluewater bounda- 
ries requires technological improvements in combination with reduc- 
tions in food loss and waste; and staying within the mean values of the 
nitrogen and phosphorus boundaries requires ambitious technological 
improvements combined (for the nitrogen boundary) with dietary 
changes towards more plant-based diets, reductions in food loss and 
waste, and, in some combinations, a more optimistic socioeconomic 
development pathway that includes lower population and higher 
income growth than is expected at present. Combining those measures 
synergistically results in adoption of different measures of technological 
change for each environmental domain, coupled in each case to dietary 
changes towards more plant-based diets, reductions in food loss and 
waste, and an optimistic socioeconomic development pathway (Fig. 4). 


Uncertainties 

Our estimates are subject to several uncertainties. Some of the planetary- 
boundary values have a large uncertainty range, which reflects the 
difficulties of scaling up local environmental pressures to global levels!?”°, 
in particular regarding bluewater use and nitrogen and phosphorus 
application (see Methods). The planetary-boundary framework can 
therefore provide only a very broad measure of the sustainability of 
the food system. Our analysis indicates that using the upper bound 
of the planetary-boundary range increases the option space (Fig. 3) 
and, for example, does not require reductions in food loss and waste 
or a more optimistic socioeconomic development pathway; however, 
meeting the lower bound of the planetary-boundary range would 
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Fig. 4 | Combination and relative contributions of mitigation measures 
that simultaneously reduce environmental impacts below the mean 
values of the planetary-boundary range. The mitigation measures include 
different levels of technological improvements for each environmental 
domain (measures of high ambition (tech+) for nitrogen and phosphorus 
application, and measures of medium ambition (tech) for GHG emissions 
and for cropland and bluewater use). The other measures are not 
differentiated by environmental domain, and include a halving of food loss 
and waste (waste/2), changes towards more plant-based flexitarian diets 
(FLX), and optimistic socioeconomic development with higher income and 
lower population growth (SSP1) than expected at present. A middle-of- 
the-road development pathway is also feasible when combined with more 
ambitious reductions in food loss and waste (see Fig. 3). 


not be possible for bluewater use and nitrogen application with the 
mitigation options considered here. Using different control variables to 
measure the state of planetary boundaries could also affect the option 
space. However, assessing the impacts of nitrogen pollution by using a 
measure of nitrogen surplus that accounts for all inputs and offtakes of 
nitrogen had little influence on the option space (Extended Data Fig. 3). 

Other uncertainties are related to the set-up of our modelling frame- 
work. Although we did consider some feedback effects between the 
different measures of change—particularly between changes in yields 
and the demand for bluewater, nitrogen and phosphorus use—this 
was limited to the scenarios of medium ambition (see Methods). This 
method allowed for the differentiated adoption of ambitious technolog- 
ical change for domains other than cropland use without also requiring 
such levels for the latter. In a sensitivity analysis, we assessed the feed- 
back effects that very high yield increases could have on nitrogen and 
phosphorus application*’, and found that the demand for nitrogen and 
phosphorus could increase across the different scenario combinations 
with large yield-gap closures by 8-14% and 25-32%, respectively, which 
would moderately reduce the planetary option space for those scenarios 
(Extended Data Fig. 3). In line with our focus on mitigation measures, 
we did not assess the impacts that climate change could have on crop 
yields and freshwater availability*®. While economic responses might be 
able to mitigate some proportion of the biophysical impacts of climate 
change”’, such responses could reduce the availability and effectiveness 
of additional mitigation and adaptation measures, and thereby reduce 
the planetary option space. 

Additional research would reduce the uncertainty of our scenario 
analysis. In our scenarios of change, we chose to focus on changes— 
technological, dietary, and in food loss and waste—that are considered 
realistic or attainable, or have been set as goals. This means that we did 
not include technologies or mitigation measures that have large uncer- 
tainties at present, such as soil carbon sequestration, nitrogen-fixing 
cereals, or landless biomass production. Some of those measures have 
shown some prospect in certain regions, but it is not yet clear whether 
they are scalable and what their relationship to existing technologies 
and environmental targets would be*®. For example, land-based carbon 
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sequestration, while reducing GHG emissions, could put additional 
pressures on croplands or pastures, with implications for land-use 
and biodiversity targets. Other areas for further research include the 
quantification of co-benefits of food-system change, for example, on 
health’>“?, biodiversity°®, and the economy”, as well as context-specific 
metrics of sustainability and a greater focus on livelihood, for example 
in terms of food security*!. 


Policy implications 

Our analysis suggests that staying within the planetary boundaries of 
the food system requires a combination of measures: GHG emissions 
cannot be sufficiently mitigated without dietary changes towards more 
plant-based diets; cropland and bluewater use are best addressed by 
improvements in technologies and management that close yield gaps and 
increase water-use efficiency; and reducing nitrogen and phosphorus 
application will require a combination of measures to stay below the 
mean values of the planetary boundaries, including dietary change, 
reductions in food loss and waste, improvements in technologies and 
management that increase use efficiencies for nitrogen and recycling 
rates for phosphorus, and efforts in global socioeconomic development. 

Implementation of these measures will depend on the regulatory and 
incentive framework in each region. In particular, practical options 
exist for improving technologies and management practices (Extended 
Data Table 1), but adoption of those options will require investment in 
public infrastructure, the right incentive schemes for farmers (including 
support mechanisms to adopt best available practices), and better 
regulation (for example, of water use and quality). Concrete options 
also exist for improving socioeconomic development in developing 
countries, including investments in education, particularly for women, 
and improving access to general and reproductive health services”. 
Meaningfully reducing food loss and waste will require measures 
across the entire food-supply chain*°, with possible emphasis on 
investments in agricultural infrastructure, technological skills, stor- 
age, transport, and distribution in developing regions; and education 
and awareness campaigns, food labelling, improved packaging that 
prolongs shelf life, and changes in legislation and business behaviour 
that promote closed-loop supply chains (in which waste is recycled 
back into the system) in developed areas. For dietary change, the avail- 
able evidence suggests that providing information without additional 
economic or environmental changes has a limited influence on behav- 
iour, and that integrated, multicomponent approaches that include 
clear policy measures might be best suited for changing diets*?*. 
Those can include a combination of media and education campaigns; 
labelling and consumer information; fiscal measures, such as taxa- 
tion, subsidies, and other economic incentives; school and workplace 
approaches; local environmental changes; and direct restriction and 
mandates™. An important first step would be to align national food- 
based dietary guidelines with the present evidence on healthy eating 
and the environmental impacts of diets*>”°. 

Our analysis suggests that the environmental impacts of the food 
system could increase markedly owing to expected changes in food 
consumption and production, and, in the absence of targeted measures, 
would exceed planetary boundaries to the extent that key ecosystem 
processes could become at risk of being destabilized. Synergistically 
combining improvements in technologies and management, reduc- 
tions in food loss and waste, and dietary changes towards healthier, 
more plant-based diets, with particular attention to local contexts and 
environmental pressures, will be a key challenge in defining region- 
specific pathways for the sustainable development of food systems 
within the planetary option space. We hope that the country-specific 
data and suite of scenarios produced for this study (see Methods, ‘Data 
availability’) can provide a good starting point for this endeavour. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
https://doi.org/10.1038/s41586-018-0594-0. 
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METHODS 


Food-system model. For our analysis, we constructed a food-systems model 
that connects food consumption and production across regions (Supplementary 
Information). We distinguished several steps along the food chain: primary 
production (including non-food uses, for example, in industry, seed banks, and 
as biofuels); trade in primary commodities; processing to oils, oil cakes and 
refined sugar; use of feed for animals; and trade in processed commodities and 
animals (Extended Data Table 2). We parameterized the model with data from the 
International Model for Policy Analysis of Agricultural Commodities and Trade 
(IMPACT)*? on current and future food production, processing factors, and feed 
requirements for 62 agricultural commodities and 159 countries. Projections of 
future food consumption and production were based on statistical association 
with changes in income and population, and were in line with other projections””. 

To assess the environmental impacts of the food system, we paired the food- 
system model with a set of country-specific environmental footprints related to 
GHG emissions, cropland use, bluewater use, and nitrogen and phosphorus appli- 
cation (Extended Data Table 3; data available upon request). In line with projections 
of the allowable agricultural emissions budget”, and our separate treatment of land 
use, we focused on the non-CO) emissions of agriculture, in particular methane 
and nitrous oxide. Data on GHG emissions were adopted from country-specific 
analyses of GHG emissions from crops”? and livestock**. Non-CO} emissions of 
fish and seafood were calculated on the basis of feed requirements and feed-related 
emissions of aquaculture”, and on projections of the ratio between wild-caught 
and farmed fish production®»®. Our baseline emissions estimate agrees well with 
existing ones that follow the same methodology’. 

Data on cropland and consumptive bluewater use were adopted from the 
IMPACT model™. To derive commodity-specific footprints, we divided use data 
by data on primary production, and we calculated the footprints of processed 
goods (vegetable oils, refined sugar) by using country-specific conversion ratios®?, 
and splitting co-products (oils and oil meals) by economic value to avoid double 
counting. We used country-specific feed requirements for terrestrial animals” 
to derive the cropland and bluewater footprints for meat and dairy, and we used 
global feed requirements for aquaculture” and projections of the ratio between 
wild-caught and farmed fish production®® to derive the cropland and bluewater 
footprints for fish and seafood. 

Data on fertilizer application rates of nitrogen and phosphorous were adopted 

from the International Fertilizer Industry Association”. In line with the planetary 
boundaries, we focus on application rates as the control variables in our main 
analysis. However, we note that regional environmental impacts often depend on 
the surplus of reactive nitrogen, a measure that accounts for all inputs and offtakes 
of nitrogen™. For a sensitivity analysis, we therefore constructed a region-specific 
nitrogen-budget module and linked it to the food-system model. Therein, we 
define the nitrogen surplus as the sum of fertilizer use, fixation by crops, manure 
application, human excreta and atmospheric deposition, minus nitrogen offtake 
by crops?”#35 (Supplementary Information). The results of the sensitivity analysis 
are reported in Extended Data Fig. 3. 
Scenario analysis. We used the food-system model to estimate the environmental 
impacts of the food system in 2050 on GHG emissions, cropland use, bluewater use, 
and nitrogen and phosphorus application. To estimate the environmental impacts 
in the absence of dedicated mitigation measures (a scenario we term ‘baseline 
projectiom), we paired the footprints of current intensity to future projections of 
food demand along several socioeconomic pathways that were developed by the 
climate-change research community (Supplementary Table 1), including a middle- 
of-the-road development pathway (SSP2), a more optimistic pathway with higher 
income and lower population growth (SSP1), and a more pessimistic pathway with 
lower income and greater population growth (SSP3)** ®*. Underlying the pathways 
are data and projections of the age, sex and educational structure of populations, 
as well as age-specific fertility, mortality and migration®”. 

We then analysed the option space for reducing the environmental pressures 
of the food system by constructing scenarios of changes in food loss and waste, 
technological change and dietary change (Extended Data Table 1). For each meas- 
ure, we differentiated between changes of medium and high ambition. Estimates 
of food loss and waste were based on percentage values reported by the UN Food 
and Agriculture Organization (FAO)**. In the standard scenario (waste/2), we 
assumed that food losses at the production side and food waste at the consumption 
side are reduced by half—a goal in line with the UN Sustainable Development 
Goals for 2030. In the ambitious scenarios (waste/4), we assumed reductions in 
food loss and waste of 75%, which is probably close to the maximum value that 
can be theoretically avoided*”. 

The scenarios of technological change (tech and tech+) include projected 
efficiency gains in emissions intensities, agricultural yields, feed conversion, 
water use, and nitrogen and phosphorus application (Extended Data Table 4). For 
the scenarios describing changes in emissions intensities of foods, we incorpo- 
rated the mitigation potential of bottom-up changes in management practices and 


technologies by using marginal abatement cost curves"! and the value of the social 
cost of carbon (SCC) in 2050. The mitigation options included changes in irriga- 
tion, cropping and fertilization that reduce methane and nitrous oxide emissions 
for rice and other crops, as well as changes in manure management, feed conversion 
and feed additives that reduce enteric fermentation in livestock. We used SCC 
values of 72 US dollars per metric ton of CO2 equivalents (US$/tCO, equivalents) 
associated with a rate of discounting future climate damages by 3% for the 
scenario of medium ambition (tech), and implemented all available mitigation 
options (equivalent to using a SCC of above 99 US$/tCO? equivalents) for the 
scenario of high ambition (tech+). No marginal abatement curves were available 
for some crops, such as fruits, vegetables, nuts, sugar crops and oilseeds. Adopting 
the average mitigation potential for staple crops for these crops would increase the 
total mitigation potential by 1%. 

Efficiency gains in agricultural yields, water management and feed conversion 
were based on IMPACT projections*’. For water management, we relied on an 
integrated hydrological model within IMPACT that operates at the level of water- 
sheds and accounts for management changes that increase basin efficiency, storage 
capacity and better utilization of rainwater**. For most crops, improvements in 
water management exceed increased water demand associated with yield improve- 
ments, except for soybeans. For agricultural yields, the gains in land-use efficiency 
matched estimates of yield-gap closures of about 75% between present yields 
and yields that are feasible in a given agricultural-climatic zone*”. The potential 
efficiency gains in nitrogen and phosphorus application rates included rebalancing 
of fertilizer application rates between overapplying and underapplying regions in 
line with closing yield gaps”. In the ambitious technology scenario (tech+), we 
increased yield-gap closures to 90% on the basis of data from a previous study*’, 
and assumed additional improvements in nitrogen-use efficiency of 30% (in line 
with targets suggested by the Global Nitrogen Assessment™) and a recycling rate 
of phosphorus’ of 50%. No further changes in efficiency were assumed for water 
use in the tech+ scenario. For most crops, land-use efficiencies increase in the 
ambitious technology scenario, except in the case of soybeans, which are assessed 
on a more conservative basis in a previous study** than by the IMPACT team. 

The scenarios of dietary change include shifts towards diets that are in line 
with global dietary guidelines (guidelines), and towards dietary patterns that 
are more specialized but nutritionally balanced (flexitarian). For the former, we 
followed suggestions to limit the intake of red meat to less than 300 g per week” 
and the intake of added sugar to less than 5% of total energy intake (about 31 g 
per day)’!, to consume five portions (400 grams per day) or more of fruits and 
vegetables*, and to balance energy intake (and physical activity levels) to maintain 
a healthy body weight*>. Estimates of energy intake were based on the calorie needs 
of a moderately active population of US characteristics for height, divided into 
five-year age groups’”—something that can be seen as an upper bound. Calorie 
needs reach a maximum of 2,500 kcal per day for ages 19-25 (averaged between 
men and women), but are reduced to 2,000 kcal per day for ages 66 and older. 
The average calorie needs differed by region according to its age composition, 
and averaged around 2,100 kcal per day. In a sensitivity analysis, we implemented 
changes in dietary composition only, without restricting energy intake. Baseline 
intakes of food and energy were calculated from food-availability projections of 
the IMPACT model by using region-specific factors of food waste and ratios of 
the edible portions of foods”*. 

In scenarios of ambitious dietary change, we increased the stringency of the 
global recommendations and defined more plant-based (flexitarian) dietary 
patterns that reflect current evidence on healthy eating?”“%7> (Extended Data 
Table 5 and Supplementary Table 2). The flexitarian diets included: at least 500 g 
per day of fruits and vegetables of different colours and groups (the composition of 
which is determined by regional preferences); at least 100 g per day of plant-based 
protein sources (legumes, soybeans and nuts); modest amounts of animal-based 
proteins, such as poultry, fish, milk and eggs; and limited amounts of red meat 
(one portion per week), refined sugar (less than 5% of total energy), vegetable 
oils that are high in saturated fat (in particular palm oil) and starchy foods with 
a relatively high glycaemic index. We aimed to preserve the regional character of 
dietary patterns by maintaining the regional composition of specific foods within 
broader categories, such as preferences for specific staple crops (wheat, maize, rice 
and so on) and fruits (temperate or tropical). 

Planetary boundaries. The planetary-boundary framework attempts to define 
a safe operating space for humanity that is characterized by a stable Earth 
system! Above planetary boundaries, it is suggested that ecosystem processes 
are at risk of becoming destabilized'"*. To contextualize the environmental 
impacts of the food system, we critically reviewed and adapted planetary-boundary 
values for GHG emissions, cropland use, bluewater use, and nitrogen and phos- 
phorus application (Extended Data Table 7). For the climate-change boundary, 
we adopted an emissions budget for food-related (non-CO2) GHG emissions 
that is in line with having a 66% chance of limiting global warming to below 2°C 
(Representative Concentration Pathway (RCP)2.6); we derived this budget from 
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a model comparison of three integrated assessment models*’, normalized to the 
marker scenario of the associated emissions pathway®’. The resulting budget of 
4.7 GtCO> equivalents (range 4.3-5.3 GtCO equivalents). focuses on the non-CO} 
emissions related to agriculture (methane and nitrous oxide), in line with previous 
assessments”® and methodology followed by the International Panel on Climate 
Change. However, we note that agriculture and land use also act as source and sink 
for CO>, for example through deforestation and carbon sequestration in soils”. 
How those flows should be balanced vis-a-vis the emissions from other sectors, 
and how additional pressures from land-based CO, sequestration contribute or 
counteract other sustainability targets and planetary boundaries, are important 
questions for future research. 

Large uncertainties exist as to what an appropriate planetary boundary for land 
use should be!”. From an analysis of forest biomes, a boundary value’ was previously 
suggested in line with maintaining (not increasing pressure on) present forest cover. 
Such a target is in line with the strongly correlated target for biosphere integrity 
if nonagricultural land is placed under protection of biodiversity-compatible 
land use’»”>-’6, Because our modelling framework explicitly tracks cropland use, 
we translate the suggested target to a value of keeping current cropland use at 
12.6 million km? (range 10.6-14.6 million km?), given our own model calculations 
using the IMPACT model*’, In future work, it will be desirable to include the 
role of pastures, an explicit treatment of forest cover, and further differentiation 
of other forms of land cover. However, a complication with switching from land 
use to forest cover is that the latter depends not only on agriculture, but also on 
wood harvesting, urbanization, and other socioeconomic variables. More than 
two-thirds of agricultural land is used for grazing. Converting highly productive 
grazing land into cropland could therefore be a conservation strategy that would 
relax the boundary value for cropland without affecting forest cover. However, 
estimates of feasible conversion ratios are still a matter of debate”’. 

Two basin-level assessments of the environmental flow requirements of river 
systems have been used to suggest planetary boundaries for the consumption of 
bluewater’?”°. We adopt the more stringent values of the more detailed standalone 
analysis (2,800 km’; range 1,100-4,500 km?)°, which includes the other suggested 
values in its uncertainty range!*’’. Because not all bluewater is used in agriculture, 
we scale from total consumptive bluewater use (2,550 km?3)° to the consumptive 
bluewater used in agriculture (1,810 km?) as assessed with our hydrological model, 
which yields a boundary of 1,980 km? (range 780-3,190 km) of bluewater used 
in agriculture. We note that uncertainties persist about the concrete assumptions 
on environmental flow requirements!”’8, and about which methodology would 
be best suited”’. 

To inform the boundary value for reactive nitrogen, a previous study’? calcu- 
lated global risk values for eutrophication on the basis of region-specific estimates 
of current nitrogen concentration in run-off and concentrations that would stay 
below ecological and toxicological thresholds of inorganic nitrogen pollution. The 
original boundary value for nitrogen was calculated by multiplying the global risk 
value by an estimate of current anthropogenic nitrogen fixation (fertilizer use 
plus fixation by crops)!*. Here we apply the risk values to nitrogen application 
from fertilizers—in line with the focus in the planetary-boundary literature on 
anthropogenic disruptions of ecosystems! !*—and we use the nitrogen surplus 
(the sum of fertilizer use, fixation by crops, manure application, human excreta and 
atmospheric deposition, minus nitrogen offtake by crops) as a control variable in a 
sensitivity analysis (Extended Data Fig. 3). The resulting estimate of 52-69 TgN per 
year (67-90 TgN when using nitrogen surplus as a control) might be considered 
conservative, because the previous study!” maintained regions that currently apply 
less than the critical load of nitrogen at that value, which in some cases can be much 
lower than needed from an environmental and food-security perspective®®. For 
that reason, we adopted an upper boundary value in line with a scenario” that 
balanced nitrogen application between overapplying and underapplying regions 
and closed yield gaps to 75%, which yielded a final boundary value of 69 TgN 
(range 52-113 TgN) of nitrogen application from fertilizers (90 TgN (range 
67-146 TgN) of nitrogen surplus). 

Unlike nitrogen, phosphorus can build up in the soil and is washed out as 
run-off during erosion’. Existing estimates of boundary values for phosphorus!® 
have several shortcomings in that they are based on constant erosion rates and 
do not take into account critical sources of phosphorus, such as human waste/ 
excreta. In the previous study’? a global phosphorus-flow model was developed 
that focused on added phosphorus assuming steady-state surface pools, critical 
phosphorus concentrations of 50-100 mg per litre to prevent eutrophication, and 
flexible recycling rates (Extended Data Fig. 2 and Supplementary Information). 
Under no-waste recycling, the long-term phosphorus boundary amounted to 
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6-12 TgP per year, increasing to 8-16 TgP per year at a recycling rate of 50%. In 
line with our focus on scenarios of change, we adopted the latter values. As with 
nitrogen, there are great regional imbalances of phosphorus application*’, so we 
again infer an upper tolerable value from a scenario” that rebalanced phosphorus 
application between overapplying and underapplying regions and closed yield gaps 
to 75%. The resulting internally derived phosphorus boundary is 16 TgP (range 
8-17 TgP) of phosphorus application. 

Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 
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medium ambition (comb(med)) and of all measures of high ambition lower population growth) and more pessimistic (lower income and higher 
(comb(high)). The mitigation measures include changes in food loss and population growth) are indicated by the uncertainty range around the 


waste (loss&waste), technological change (technology) and dietary change —_ markers (socio-econ). 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


P human 


1-frP 2. 


Pane 


frP 


ret,sed 


| Peiment 


ret,soil 


water 


1-frP 


ret,sed 
Extended Data Fig. 2 | Overview of major flows of phosphorus at the that is not recycled to land (1 — frPyec), which becomes a point source 
global scale. The external acceptable phosphorus (P) input is determined for water pollution. This phosphorus can only be stored in sediment 
by the acceptable long-term accumulation of phosphorus in the soil at a given phosphorus-retention fraction (frPyet,seq), While the recycled 
(P soil) and sediment (P sediment) at a phosphorus concentration in phosphorus can additionally be stored in soil (at a retention fraction 
surface waters (P surface water) that equals a critical threshold. The frPret,soil)- The critical phosphorus input (Pincrit)) can be calculated as 
phosphorus boundary is affected by the fraction of phosphorus that the sum of critical phosphorus retention in the soil and sediment, and a 
is taken up by humans (P human; frP uptake being the P-use efficiency, critical input to surface water (oceans) that is due to run-off and leaching. 
PUE, of the complete food chain, from mined phosphorus (P mine) to The Supplementary Information contains a full derivation of phosphorus 
P intake) and the fraction of phosphorus excreted by humans (P waste) flows and quantitative estimates of critical phosphorus inputs. 
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Extended Data Fig. 3 | Planetary option space related to different 
control variables of nitrogen and yield-related feedback effects. The 
control variables include nitrogen inputs related to synthetic fertilizers 

as used in the main analysis, and the more comprehensive measure of 
nitrogen surplus that accounts for all inputs and offtakes of nitrogen. The 
types of feedback effects include changes in nitrogen and phosphorus 
application associated with closing yield gaps by 75%, as modelled in the 


tech scenario for cropland use (main), and changes associated with closing 
yield gaps by 90%, as modelled in the tech+ scenario for cropland use 
(high yields). Colours and numbers indicate combinations that are below 
the lower bound of the planetary-boundary range (dark green, 1), below 
the mean value but above the minimum value (light green, 2), above the 
mean value but below the maximum (orange, 3), and above the maximum 
value (red, 4). 
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Extended Data Table 1 | Scenarios of reductions in food loss and waste, technological change and dietary change 


Scenario Assumptions 


Food losses and waste are reduced by half, in line with pledges made as part of the Sustainable 


Waste/2 
Development Goals ”°. 


Food losses and waste are reduced by three quarters, a value likely close to the maximum value that can 


Waste/4 
be theoretically avoided °°. 


Closing of yield gaps between attained and attainable yields to about 75% **”*; Rebalancing nitrogen and 
phosphorus fertilizer application between over and under-applying regions a improving water 
management, including increasing basin efficiency, storage capacity, and better utilization of rainwater °°; 
Tech and implementation of agricultural mitigation options that are economic at the projected social cost of 
carbon in 2050, including changes in irrigation, cropping and fertilization that reduce methane and nitrous 
oxide emissions for rice and other crops, as well as changes in manure management, feed conversion 


and feed additives that reduce enteric fermentation in livestock °". 


Additional measures on top of TECH scenario, including additional increases in agricultural yields that 
close yield gaps to 90% **: a 30% increase in nitrogen use efficiency in line with suggested targets **, 
and 50% recycling rates of phosphorus ’; phase-out of first-generation biofuels **; and implementation of 
all available bottom-up options for mitigating food-related GHG emissions *". 


Tech+ 


Dietary shifts towards global dietary guidelines, including maximum intakes for red meat (three 100g 
Guidelines servings per week) and sugar (5% of energy intake), minimum intakes of fruits and vegetables (five 
(HGD) servings a day), and energy intakes in line with recommendations on healthy body weight and physical 


activity (2100-2200 kcal per day on average) 2°67", 


Dietary shifts towards more plant-based, flexitarian dietary patterns based on recent evidence on healthy 
Flexitarian eating °”4°.? that, in addition to the HGD requirements, include more stringent limits for red meat (one 
(FLX) serving a week), limits for white meat (half a portion a day) and dairy (one portion a day), and greater 
minimum amounts of legumes, nuts, and vegetables. 


HGD, guidelines; FLX, flexitarian. Data were obtained from previous studies”:29-37:45,70,71,73, 
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Extended Data Table 2 | Global food production in 2010 and 2050 differentiated by food group and step along the food chain 


Food item ee 2 ee ee 
prod trade intr feed othr loss waste cons prod trade int’ feed othr loss waste cons 
wheat 639 114 0 109 47 34. 153 9295 892 256 0 138 75 51 206 423 
rice 430 28 0 24 16 40 33 317 538 59 0 48 19 49 33 390 
maize 797 =102 0 464 96 123 32 82 1,361 304 0 878 146 156 49 133 
other grains 315 45 O 172 38 30 21 55 514 146 0 268 49 55 37-106 
roots 767 50 0 164 56 111 109 327 1,145 130 0 167 100 232 153 494 
legumes 62 10 0 14 3 2 1 42 113 28 0 22 5 4 1 80 
soybeans 225 62 189 10 8 5 0 12 357 98 282 17 22 15 1 21 
nuts & seeds 37 14 8 4 4 5 0 16 54 21 9 5 8 11 1 21 
vegetables 996 67 0 0 O 124 296 575 1,826 351 0 0 0 206 522 1,098 
oilcrops 149 12 145 1 1 2 0 0 227 14 218 1 3 5 0 0 
palmcrop 224 0 224 0 0 0 0 0 614 O 614 0 0 0 0 0 
oilmeals 83 14 0 83 0 0 0 0 132 31 0 132 0 0 0 0 
soybmeal 155 48 O 155 0 0 0 0 232 85 0 232 0 0 0 0 
sugarcrops 1,758 0 1,758 0 0 0 0 0 3,396 0 3,396 0 0 0 0 0 
fruits (temperate) 206 25 0 0 0 63 50 92 325 57 0 0 0 97 78 150 
fruits (tropical) 260 37 0 0 0 26 77 ~— 156 462 94 0 0 0 54 130 277 
fruits (starchy) 127 21 0 0 0 27 30 70 318 57 0 0 0 82 69 167 
sugar 160 44 0 0 5 12 14 129 304 107 0 0 20 24 21 = 239 
palm oil 45 34 0 0 1 28 0 16 124 94 0 0 2 77 1 43 
vegetable oil 83 19 0 0 4 24 2 54 122 36 0 1 9 34 2 75 
beef 69 9 0 0 0 1 6 63 121 20 0 0 0 1 9 111 
lamb 15 2 0 0 0 0 1 13 33 5 0 0 0 0 2 30 
pork 105 9 0 0 0 1 9 95 131 31 0 0 0 1 11. 120 
poultry 85 8 0 0 0 1 7 77 173 30 0 0 0 2 12 158 
eggs 66 2 0 0) 5 3 4 54 96 10 0 0 8 5 5 78 
milk 627 55 0 0 0 35 35 557 1,000 156 0 0 0 67 48 885 
shellfish 33 6 0 0 0 0 18 15 49 11 0 0 0 0 27 22 
fish (freshwater) A 4 0 0 0 0 22 19 81 17 0 0 0 0 43 38 
fish (pelagic) 17 4 0 0 0 0 9 8 15 5 0 0 0 0 8 7 
fish (demersal) 27 7 0 0 0 0 15 12 29 10 0 0 0 0 16 13 


Global food production is shown in megatonnes. Steps include consumption (cons), food waste at the household level (waste), food loss at production (loss), industrial and other demand for 
agricultural products (othr), feed demand (feed), intermediate demand for processing into oils, oil meals and sugar (intr), traded food products (trade; globally, imports equal exports), and 
total production (prod = cons + waste + loss + othr + feed + intr). 
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Extended Data Table 3 | Environmental footprints of food commodities (per weight of product) 


Food item GHG intensity Cropland use Bluewater use Nitrogen use Phosphorus 


(kgCO2/kg) (m2/kg) (m3/kg) (kgN/t) use (kgP/t) 
wheat 0.23 3.36 0.49 28.73 4.39 
rice 1.18 3.51 1.07 36.64 5.20 
maize 0.19 1.98 0.15 22.77 3.57 
other grains 0.29 6.14 0.17 16.36 2.71 
roots 0.07 0.69 0.04 3.63 0.71 
legumes 0.23 11.02 0.95 0.00 0.00 
soybeans 0.12 3.95 0.14 2.75 5.88 
nuts & seeds 0.71 6.39 0.43 14.27 2.11 
vegetables 0.06 0.49 0.09 9.55 1.67 
fruits (temperate) 0.08 1.18 0.33 12.73 1.91 
fruits (tropical) 0.09 0.94 0.32 10.27 1.58 
fruits (starchy) 0.11 0.85 0.12 6.26 1.07 
sugar crops 0.02 0.15 0.11 2.03 0.35 
oil crops 0.46 5.45 0.31 31.33 5.61 
palm crop 0.38 0.63 0.00 4.57 0.73 
sugar 0.19 1.67 1.22 22.34 3.84 
palm oil 1.85 3.10 0.00 22.33 3.57 
vegetable oil 0.67 10.31 0.47 42.73 11.47 
beef 32.49 4.21 0.22 27.29 5.36 
lamb 33.02 6.24 0.49 27.51 4.94 
pork 2.92 6.08 0.35 51.52 8.87 
poultry 1.41 6.59 0.40 50.20 9.02 
eggs 1.58 6.86 0.44 51.22 8.81 
milk 1.22 1.34 0.08 6.32 1.58 
shellfish 0.07 0.36 0.03 3.35 0.81 
fish (freshwater) 0.30 1.51 0.10 16.78 3.62 
fish (demersal) 0.02 0.12 0.01 1.20 0.29 
fish (pelagic) 0.00 0.00 0.00 0.00 0.00 


Footprints for animal products represent feed-related impacts, except for GHG emissions of livestock, which also have a direct component. Cropland use does not include grassland use and the use of 
grass inputs for ruminants. Footprints for fish and seafood represent feed-related impacts of aquaculture production weighted by total production volumes. Displayed are global averages; the regional 
ordering between food items can differ by region. 
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Extended Data Table 4 | Reductions in environmental footprints (as percentages) resulting from technological changes by food group 


eaten GHG emissions Cropland use Bluewater use pale ata la 
tech tech+ tech tech+ tech tech+ tech tech+ tech tech+ 
wheat -9.9 -13.8 -31.5 -37.4 -38.6 -38.6 -4.6 -33.2 -15.8 -57.9 
rice -22.4 -27.6 -25.1 -26.7 -17.2 -17.6 0.7 -29.5 -8.7 -54.3 
maize -9.7 -12.5 -32.6 -36.8 -24.6 -24.7 -10.7 -37.5 -17.8 -58.9 
other grains -10.5 -15.6 -39.6 -37.3 -27.1 -27.2 5.9 -25.9 -13.6 -56.8 
roots 0.0 0.0 -32.4 -43.7 -27.5 -28.0 0.0 -30.0 0.0 -50.0 
legumes -8.8 -12.6 -38.1 -49.9 -32.3 -32.4 0.0 0.0 0.0 0.0 
soybeans -9.3 -12.4 -19.6 -2.9 10.1 9.8 0.0 -30.0 0.0 -50.0 
nuts & seeds 0.0 0.0 -23.9 -35.2 -12.8 -12.8 0.0 -30.0 0.0 -50.0 
vegetables 0.0 0.0 -35.4 -47.1 -39.3 -39.6 0.0 -30.0 0.0 -50.0 
fruits (temperate) 0.0 0.0 -18.2 -45.5 -25.5 -25.8 0.0 -30.0 0.0 -50.0 
fruits (tropical) 0.0 0.0 -35.9 -50.0 -46.1 -46.1 0.0 -30.0 0.0 -50.0 
fruits (starchy) 0.0 0.0 -40.4 -57.7 -25.1 -25.1 0.0 -30.0 0.0 -50.0 
sugar 0.0 0.0 -20.5 -21.2 -26.1 -26.1 0.0 -30.0 0.0 -50.0 
palm oil 0.0 0.0 -22.4 -23.8 -59.1 -59.1 0.0 -30.0 0.0 -50.0 
vegetable oil -1.2 -1.5 -18.5 -43.1 -4.7 -4.9 0.0 -30.0 0.0 -50.0 
beef -9.1 -10.7 -31.4 -34.4 -25.6 -25.6 -2.2 -31.5 -13.2 -56.6 
lamb -8.6 -10.2 -35.7 -42.3 -23.9 -24.0 -1.6 -31.1 -13.2 -56.6 
pork -11.8 -15.5 -29.6 -32.6 -24.7 -25.2 -9.3 -36.5 -15.3 -57.6 
poultry -11.0 -13.6 -31.6 -34.3 -26.0 -26.1 -8.8 -36.1 -16.6 -58.3 
eggs -12.5 -15.4 -30.6 -35.3 -26.5 -26.9 -12.5 -38.8 -17.7 -58.8 
milk -9.8 -12.0 -26.3 -34.5 -16.4 -16.5 -0.2 -30.1 -5.6 -52.8 
shellfish -10.1 -12.9 -22.7 -35.3 -24.0 -24.2 -2.0 -31.4 -4.9 -52.5 
fish (freshwater) -4.7 -6.2 -22.5 -37.2 -18.9 -19.0 -3.9 -32.7 -5.6 -52.8 
fish (demersal) -5.9 -7.8 -22.5 -35.5 -22.7 -22.9 -3.8 -32.7 -5.4 -52.7 
fish (pelagic) 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 


Technological changes include changes of medium ambition (tech) and changes of high ambition (tech+). Zero entries indicate where no data were available to infer potential improvements, and for 
pelagic fish reflect a production method (marine fishing) that does not require feed inputs. 
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Extended Data Table 5 | Food-based dietary recommendations for healthy, more plant-based (flexitarian) diets 


Food item 


wheat 

rice 

maize 

other grains 


minimum level maximum level 


g/d serving g/d serving 
A total of up to 
860 kcal/d for 3-4 


energy 
balance for al 
staple crops 


| (1/3 of energy) 


roots 
legumes 50 1/2 
soybeans 25 1/4 
nuts & seeds 50 2 
vegetables 300 3-4 
fruits 200 2-3 
sugar 31 5% of energy 
palm oil 6.8 1 
vegetable oil 80 1/3 of energy 
beef A total of 14 
lamb g/d for all red 1/7 
pork meat 
poultry 29 1/2 
eggs 13 1/5 
milk 250 1 
Biel A total of 28 
fish (freshwater) g/d for all 

g 1/2 
fish (demersal) fish and 

seafood 


fish (pelagic) 


ARTICLE 


The recommendations include recommended minimum and maximum intakes expressed as weight or calories, and servings. Fish and seafood can be replaced by plant-based foods (legumes, 
soybeans, nuts and seeds, fruits and vegetables) in vegetarian diets. Units are g or kcal per day. 
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Extended Data Table 6 | Decomposition of impacts of dietary scenarios 


Change in environmental impacts by food group 


Indicator Diet scenario nuts& _ fruits& ; animal 
total staples legumes eeeds veg veg oils sugar products 

GHG HGD(E=BMK) -2,513. 113 -4 Z 43 3 30. -2,631 
emissions HGD 2,850  -224 -4 1 43 8 -30  -2,631 
(MtCO2-eq) FLX 5,063  -497 35 28 59 e -30  -4,651 
HGD(E=BMk) 919 1,596 0 0 450 0 -257 -870 

een HGD -1,540 — -864 0 0 450 0 -257 ~~ -870 
FLX 2,307 -2,340 1,092 407 486 716 -257. -2,415 

HGD(E=BMk) 201 +~=—«-227 0 0 244 215 56 

gir wae HGD 136-113 0 0 245 -215 56 
FLX -332--394—S ‘110 39-220 48-214. -143 

Nitrogen  HGD(E=BMK) 3,587 10,782 0 0 2,827 1 -3,322 -6,703 
application HGD 14,784 -7,537 0 0 2,811 4 -3,324 -6,719 
(GgN) FLX -29,723 -17,082 241 672 5,081 1,660 -3,327 -16,935 
“Phosphorus HGD(E=BMK) 324 1,719 0 0 435 OO  -570 -1,261 
application HGD 2,542 -1,136 0 0 432 1-571 -1,264 
(GgP) FLX 4,464 -2,670 416 118 856 607 -571_ -3,212 


Impacts (shown as absolute changes with respect to the baseline projection in 2050) are decomposed into changes by food group and energy intake. In the (E = BMK) scenario, only dietary 
composition is changed, whereas in the main scenarios, dietary composition and energy intake are changed in line with dietary guidelines and current evidence on healthy eating. 
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Extended Data Table 7 | Derivation of planetary-boundary values of the food system 


Planetaly Motivation 

boundary 

Further increasing GHG emissions 
increase climate-related risks to 
ecosystems and cultures, e.g. from sea- 
level rise and increased occurrence of 
extreme weather events, such as heat 
waves, extreme precipitation, and coastal 
flooding °”. 


Climate change 


Method 


Food-related GHG emissions 
in line with limiting global 
warming to below 2 degrees 
Celsius °° with uncertainty 
derived from a model 
comparison of integrated 
assessment models *°. 


Boundary 


A budget of 4.7 (4.3-5.3) 
GtCO,-eq of food-related 
GHG emissions, including 
methane and nitrous oxide, 
but excluding carbon dioxide 
in line with IPCC 
methodology. 


Further increasing the amount of 
agricultural land through deforestation 
could impact the functioning of 
ecosystems °, release large amounts of 
carbon dioxide ', and diminish habitat for 
wild species and thereby pose major 
threats to biodiversity *. 


Land-system 
change 


Analysis of conservation levels 


for each forest biome in line 
with preserving ecosystem 
integrity, scaled up to a global 


value ' and related to croplan 


use 23:39. 


Not increasing pressures on 
forests by keeping global 
cropland use at 12.6 (10.6- 


14.6) Mkm?. Converting 


d productive grazing land into 


cropland can relax the 
boundary value. 


Further depletion and overexploitation of 
groundwater resources impairs natural 
streamflow, wetlands and related 
ecosystems, and can lead to land 
subsidence and salt-water intrusion in 
deltaic areas ° and, eventually, to 
cascading impacts on the global 


hydrological cycle w 


Freshwater use 


Basin-level assessments of the Maintaining environmental 


environmental flow 
requirements of river systems 
12.20 scaled to agricultural 
bluewater use °°. 


flow requirements by limiting 
agricultural bluewater use to 
1,980 (780-3,190) km? or 
below. 


Agricultural runoff from overapplication of 
fertilizers leads to eutrophication, an 
increase in chemical nutrients in the water 
7.9 which in turn can lead to excessive 
blooms of algae that deplete underwater 
oxygen levels resulting in so-called dead 
zones in coastal oceans °. 


Bio-geochemical 
flows of nitrogen 
and phosphorus 


IPCC, Intergovernmental Panel on Climate Change. Data were obtained from previous studies 


Analysis of eutrophication risk 


based on nitrogen and 


phosphorus pollution estimates |jimiting nitrogen and 


of agricultural runoff and 
ecological thresholds "°, with 
an upper value in line with re- 
balancing of application 
between over and under- 
applying regions **. 


1,3-9,12,19,20,33,39,58,63,77,82 
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phosphorus application from 
fertilizers to 69 (52-113) TgN 
and 16 (8-17) TgP 
respectively. 
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Functional genomic landscape of acute 
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The implementation of targeted therapies for acute myeloid leukaemia (AML) has been challenging because of the complex 
mutational patterns within and across patients as well as a dearth of pharmacologic agents for most mutational events. 
Here we report initial findings from the Beat AML programme on a cohort of 672 tumour specimens collected from 562 
patients. We assessed these specimens using whole-exome sequencing, RNA sequencing and analyses of ex vivo drug 
sensitivity. Our data reveal mutational events that have not previously been detected in AML. We show that the response 
to drugs is associated with mutational status, including instances of drug sensitivity that are specific to combinatorial 
mutational events. Integration with RNA sequencing also revealed gene expression signatures, which predict a role for 
specific gene networks in the drug response. Collectively, we have generated a dataset—accessible through the Beat 
AML data viewer (Vizome) —that can be leveraged to address clinical, genomic, transcriptomic and functional analyses 


of the biology of AML. 


Approximately 21,000 people are diagnosed with AML and over 10,000 
AML-related deaths are reported annually in the United States’. 
Cytogenetic and sequencing analyses have revealed at least 11 genetic 
classes of AML? and over 20 subsets can be assigned when also con- 
sidering cell differentiation states of the leukaemic blasts*°. Deep 
sequencing of AML by The Cancer Genome Atlas (TCGA) revealed 
a heterogeneous disease with nearly 2,000 somatically mutated genes 
observed across 200 patients®. Many of the recurrent cytogenetic 
events and somatic mutations have been shown to carry prognostic 
importance®”®, Some of the most frequent somatic variants can also 
be observed in myelodysplastic syndromes and myeloproliferative neo- 
plasms”! that can transform into AML. These same mutations have 


also been observed in healthy individuals who have age-related clonal 
haematopoiesis, which is associated with significant risk for the devel- 
opment of myelodysplastic syndromes, myeloproliferative neoplasms 
and AML'*}°. 

A small number of therapies targeted to mutational events have 
been developed for patients with AML, although the current standard 
of care remains largely unchanged over the past 30-40 years. The 
first targeted therapy for AML involved use of all-trans retinoic acid 
in combination with arsenic trioxide for patients with rearrangement 
of the retinoic acid receptor’®'”. More recently, fms-related tyrosine 
kinase 3 (FLT3) inhibitors have been developed for FLT3 mutational 
events that occur in approximately 20-30% of patients with AML!*7!. 
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Fig. 1 | Comparative genomic landscape of AML. a, Frequency of the 
33 mutational events that were cumulatively the most frequent in Beat 
AML (n = 531 patients) and TCGA (n = 200 patients) datasets. Top, the 
full Beat AML cohort; middle, only the de novo Beat AML cases; bottom, 
de novo cases in the TCGA. Mutations were summarized by gene as was 
done by TCGA, whereas the FLT3-ITD mutations were kept separate in 
the rest of this manuscript. b, Mutational events at 2% frequency or less 
in the de novo cases of Beat AML and TCGA were compared for overlap. 
Venn diagram displays the overlap with the small circles within each 


When FLT3 inhibitors were used as single agents, responses of only 
2-6 months were obtained””**. Midostaurin, a broad-spectrum FLT3 
inhibitor, was recently approved for use in newly diagnosed patients 
with AML with FLT3 mutations, in combination with standard of care 
chemotherapy”®; however, relapse was still prevalent in this setting. 
Targeting of a mutant form of isocitrate dehydrogenase (NADP(+)) 
1 and 2, cytosolic (IDH1 and IDH2)”’, has shown clinical benefit 
leading to approval of the IDH2 inhibitor, enasidenib, and the IDH1 
inhibitor, ivosidenib”*”?. Additional proposed strategies have included 
inhibition of epigenetic modifiers, such as enhancer of zeste 2 poly- 
comb repressive complex 2 subunit (EZH2)*°, lysine demethylase 
1A (KDM1A)*!, and DOT1-like histone lysine methyltransferase 
(DOT1L)*, based on the direct mutation of these targets or synthetic 
lethality in the context of drug combinations (all-trans retinoic acid 
and KDM1A inhibitors) or specific genetic features (lysine methy]l- 
transferase 2A (KMT2A)-gene rearrangement for DOT1L inhibitors). 
Hypomethylating agents have been used in patients with AML, for 
which better responses have been reported for certain genetic subsets, 
such as those with mutations in TET2°° or tumour protein 53 (TP53)**. 
Most recently, an inhibitor of the BCL2 apoptosis regulator (BCL2), 
venetoclax, showed an approximately 20% response rate when used 
as a single agent in patients with a relapse** and higher response rates 
(around 60%) were reported in combination with hypomethylating 
agents in newly diagnosed, elderly patients with AML**. 


Comparative genomic landscape of AML 
To better understand genetic or transcriptional markers and mecha- 
nisms of drug sensitivity and resistance in AML, we developed a cohort 
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compartment representing a size-scaled frequency of each mutational 
event. c, Analysis as in b with only the non-de novo Beat AML cases versus 
TCGA. d, Co-occurrence or exclusivity of the most recurrent mutational 
events in the Beat AML cohort (n = 531 patients) were assessed using the 
DISCOVER" method. The dot plot shows the odds ratio of co-occurrence 
(blue) or exclusivity (red) using colour-coding and circle size as well as 
asterisks that indicate FDR-corrected statistical significance. *P < 0.1; 

**D < 0.05; ***P < 0.01. 


of 672 primary specimens from 562 patients with AML and we per- 
formed extensive functional and genomic analyses on these samples. 
Detailed clinical annotations, including diagnostic information, clinical 
laboratory values, treatments, responses and outcomes were curated 
from electronic medical records and are reported in Supplementary 
Tables 1-5. 

We performed exome sequencing on 622 of the specimens from the 
cohort representing 531 different patients. The final, high-confidence 
variant list (Supplementary Table 7) revealed a range of 1-80 somatic 
variants per patient (cohort median of 13 somatic variants) (Extended 
Data Fig. 1). Comparison of the top 33 most commonly mutated genes 
across Beat AML and TCGA® showed generally similar frequencies. 
Higher frequency of mutations in serine- and arginine-rich splicing fac- 
tor 2 (SRSF2) were seen in Beat AML than in TCGA, and this difference 
was conserved when only the de novo cases in Beat AML and TCGA 
were compared (Fig. la). By contrast, mutational events that were seen 
with a frequency of less than 2% across Beat AML and TCGA were 
much more divergent; variants in 221 mutant genes were called in both 
datasets, 939 mutant genes were called only in TCGA and around 1,500 
mutant genes were called only in Beat AML, irrespective of whether 
we compared only de novo or non-de novo cases (Fig. 1b, c). Most of 
these divergent mutational events were observed only in single patients; 
however, there were mutations in 11 genes that were called in 1% or 
more of patients in Beat AML, but were not observed in previous AML 
sequencing studies (Extended Data Fig. 1). Finally, co-occurrence and 
exclusivity of the most frequent variants were computed and reveal 
significant patterns of mutational co-segregation, suggesting biological 
cooperation between certain mutational events (Fig. 1d). 


25 OCTOBER 2018 | VOL 562 | NATURE | 527 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


a - c 
Sunitinib + NPM1, FLT3-ITD e e 
KW-2449 + FLT3-ITD 
7" 250 = 
Crenolanib + FLT3-ITD| ° ere td 
es 
Sorafenib + NPM1, FLT3-ITD e 
9 
104|Sorafenib + FLT3-ITD re) e ) 
_ % 200 2 Inhibitors 
x = 
ind Patients = . e petals 
3 ( o20 ele @ JAK inhibitor | 
4 ve a > ae 
3 ‘fPazopanio (GW786034) + NRAS) © a. 6,1 eo & —— & Ruxoltinib (NCBO18424) 
& _ {Tivozanib (AV-951) + NRAS] © 80 S$ e e e Tofacitinib (CP-690550) 
2 5 { +5 
: © @{Vemurafenib (PLX-4032) + NRAS 100 
Poe 
NVP-TAE684 + KRAS = P = 0.0168 
Crizotinib (PF-2344066) + KRAS 50 
= VS oP gh 
Kr er 
0 & cs ~ ® 
cP ee fom 
100 = -50 0 50 100 g Fe 
Average difference 
b 
Entospletinib Ibrutinib 
P=0.102 P= 0.0127 P=0.0112 P=0.000262 P=1.28x10% P=0.998 P=7x10- 
300 P = 0.00174 P=0.635 P=5.63x 10-5 P= 0.235 P=0.00143 P=0.000404 P=5.53x 10-6 
e oe 
g é s : e 4 & : id : e 
S 200 2 : : & $ S || 3 
=< 3 e é 3 ry 8 * F ° 
3 é e ° . 3 3 
e é = é . ée e e 
100 es ‘ ¢ 2 e e 
e e e e 
2 6 €& 3 26 3 8 38 2 p & § @ 8 8 38 
rs < 7 < ro < r cz 
> = e 2 had 2 2 2 S = o 2 ° 2 2 = 
@ 2 ry J > ~ - ® = 5 5 : = = 
2 fa) irs cz = fa) fa) Fd fay z irs x= fa) a 
< & & & < £€ &F & 
eB = € § e = € & 
s fa) z in s ra) m1 ire 
fay s a x 
5 5 
= = 
fay a 


Fig. 2 | Integration of genetic events with drug sensitivity. a, Average 
difference in AUC drug response between mutant and wild-type cases was 
determined using a Student’s two-sided t-test from a linear model fit 

(x axis). The P values were corrected using the Benjamini-Hochberg 
method over all the drugs (y axis). The number of samples used to 
correlate each mutational event with drug sensitivity is reported in the 
Supplementary Table 17. Expanded and interactive plots are available in 
our online data browser (http://www.vizome.org/ and http://vizome.org/ 
additional_figures_BeatAML.html). b, AUCs for ibrutinib or entospletinib 
(n = 277 or 168 patient samples, respectively) were plotted for cases 

with single, double or triple mutations in NPM1 and DNMT3A as well as 


Drug response and gene expression 

RNA sequencing was performed on 451 specimens from 411 patients. 
Clustering of the 2,000 most variably expressed genes across the cohort 
revealed gene expression signatures that were associated with many 
of the prominent genetic and cytogenetic disease groups (Extended 
Data Fig. 2). To understand the profile of sensitivity and resistance to 
a variety of small-molecule inhibitors, we profiled primary tumour 
cells from 409 specimens derived from 363 patients against a panel of 
122 small-molecule inhibitors using an ex vivo drug sensitivity assay*”. 
Drug sensitivity patterns were analysed with respect to clinical and 
genetic features of tumours (Extended Data Fig. 3). We compared the 
average area-under-the curve (AUC) values for each drug between 
samples from cases with de novo AML and cases with AML that had 
transformed from myelodysplastic syndromes or myeloprolifera- 
tive neoplasms, through a series of single-factor analysis of variance 
(ANOVA) tests. Generally, transformed cases showed less sensitivity 
than de novo cases to most drugs. Of the 122 drugs tested, 64 were 
significantly (false discovery rate (FDR) < 0.1) more sensitive in the 
de novo samples, whereas only one drug—panobinostat (an HDAC 
inhibitor)—was significantly more sensitive in the transformed cases 
(Extended Data Fig. 4). In addition, we analysed the concordance of 
drug sensitivity patterns with respect to predicted target gene, gene 
family or pathway for each drug (drug assignments to target families 
can be found in Supplementary Table 11). This analysis revealed drug 
targets and/or drug families that were highly concordant among con- 
stituent members, as well as drug families that were quite discordant 
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FLT3-ITD. Data are mean + s.d. An ANOVA was conducted using the 
Bonferroni approach (statistical results and sample size for all groups 
are reported in Supplementary Tables 18, 19). c, Inhibitors of JAK family 
kinases were assessed for activity against cases with BCOR mutations alone 
or BCOR mutations in combination with mutations in SRSF2, RUNX1 

or DNMT3A. The AUC values are plotted per case; data are mean + s.d. 
There was a significant difference in AUC; two-sided Student's t-test 

(ta2 = —2.489, P = 0.0168, 95% confidence interval —73.018 to —7.643) 
between mutations in BCOR and RUNX1 (n = 16) versus the average of 
BCOR mutations alone (n = 16), mutations in BCOR and SRSF2 (n = 8), 
and mutations in BCOR and DNMT3A (n = 4). 


(Extended Data Fig. 5). To create a global view of overall sensitivity or 
resistance for each case, we generated a heat map of binary sensitive or 
resistant calls for each sample to each drug. We then annotated the sen- 
sitive or resistant fraction of each case against the European Leukaemia 
Net (ELN) 2017° (Extended Data Fig. 6a) and WHO (World Health 
Organization) 2016+ (Extended Data Fig. 6b) classifications. 


Gene signatures of drug responses 

We performed a cohort analysis to assess the correlation between drug 
sensitivity patterns and mutational events or gene expression levels. 
Correlation analyses between drug sensitivity and mutational events 
were performed by assessing the range of sensitivity of cases with a 
mutation in an individual gene (as well as co-occurring mutational 
events) versus cases with the wild-type sequence for that same gene. 
Broad summaries of full cohort results are displayed in Circos and 
Manhattan plots (Extended Data Figs. 7a, 8). Individual associations 
between drugs and mutations are displayed as a Volcano plot, in which 
the differences in drug sensitivity between mutant and wild-type genes 
and the FDR-corrected significance values*® are plotted (Fig. 2a). 
Some of the associations with the highest levels of statistical signifi- 
cance involved FLT3 internal tandem duplications (FLT3-ITD) and 
these showed sensitivity to FLT3 pathway inhibitors, which serves as 
a proof-of-principle as FLT3 inhibitors are known to be more effec- 
tive against FLT3-ITD AML. However, to reveal associations between 
drugs and mutations that were not biased by the co-occurrence with 
FLT3-ITD, we also plotted the same analysis using only cases that had 
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Fig. 3 | Integration of gene expression and drug sensitivity patterns. 
Differential gene expression signature distinguishing the 20% most 
ibrutinib-sensitive (n = 46) from the 20% most resistant (n = 44) 
specimens. Heat maps for all other drugs are available in our online data 


wild-type FLT3 (Extended Data Fig. 7b). We also created Volcano plots 
that were specific to each individual drug (versus all mutational events) 
and for each individual gene (versus all drugs tested). All Volcano 
plots can be found with interactive features in our online data browser 
(http://www.vizome.org/ and http://vizome.org/additional_figures_ 
BeatAML.html). 

Mutations in several genes, most notably TP53 or ASXL transcrip- 
tional regulator 1 (ASXL1), were shown to cause a broad pattern of drug 
resistance. Notably, a few drugs trended to be more sensitive to cases 
with TP53 mutations (such as elesclemol) or with mutations in ASXL1 
(such as panobinostat), suggesting candidates for further exploration of 
cases with AML that have these poor prognostic features. Mutations in 
the NRAS proto-oncogene, GTPase (NRAS) or KRAS proto-oncogene, 
GTPase (KRAS) also correlated with resistance to most drugs, although 
mutations in these genes did show the predicted sensitivity to MAPK 
inhibitors. Of particular note, there was a stronger association between 
mutations in NRAS and MAPK sensitivity than between mutations 
in KRAS and MAPK sensitivity. [DH2 mutations conferred sensitivity 
to a broad spectrum of drugs, whereas mutations in IDH1 conferred 
resistance to most drugs. Mutations in RUNX1 correlated with sen- 
sitivity to PIK3C and mTOR inhibitors (such as BEZ235) and to the 
multi-kinase vascular endothelial growth factor receptor (VEGFR) 
inhibitor, cediranib. Mutation in the splicesome components U2 small 
nuclear RNA auxiliary factor 1 (U2AF1) and zinc-finger CCCH-type, 
RNA-binding motif and serine/arginine rich 2 (ZRSR2) correlated 
with sensitivity to several drugs. The mechanisms that underlie these 
latter sensitivity correlations (and many others in the dataset) merit 
further investigation. A significant association was seen between muta- 
tions in FLT3, NPM1 and DNMT3A and sensitivity to the Food and 
Drug Administration (FDA)-approved drug, ibrutinib. Because these 
mutations exhibit a significant pattern of co-occurrence, we next exam- 
ined every combination of single, double or triple mutant genes with 
respect to ibrutinib. We observed that DNMT3A alone or DNMT3A 
and FLT3 double-mutant cases were not significantly different from 
cases with wild-type genes, whereas cases with FLT3-ITD alone or any 
combination with a mutation in NPM1 (including cases in which all 


browser (http://vizome.org/additional_figures_BeatAML.html). For the 
number of samples used to correlate each drug with gene expression, see 
Supplementary Table 17. RP11-333E13.2 is a lincRNA; RP11-196G18.24 is 
a pseudogene. 


three genes were mutated) were significantly more sensitive than cases 
with wild-type genes (Fig. 2b). Ibrutinib is an inhibitor of BTK and 
TEC family kinases, although it can exhibit broad off-target effects 
when maintained in continuous culture with target cells. We noted 
that another kinase inhibitor with high specificity for spleen-associated 
tyrosine kinase (SYK)—entospletinib—showed a similarly significant 
pattern of sensitivity in cases with FLT3-ITD and mutations in NPM1 
(Fig. 2b), potentially pointing to an operationally important target for 
this disease subset. Indeed, previous studies have suggested that SYK 
is an interacting target of FLT3-ITD in AML”. Finally, we performed 
an additional analysis that leveraged multiple inhibitors with common 
targets to see whether this approach could identify additional associ- 
ations. We focused on correlations between the four selective Janus 
kinase (JAK) inhibitors in our drug panel (momelotinib, ruxolitinib, 
tofacitinib and JAK inhibitor I) and mutations in the BCL6 corepres- 
sor (BCOR) alone or mutations in BCOR together with mutations in 
DNMT3A, RUNX1 or SRSF2. By plotting the average difference of each 
JAK inhibitor between mutant and wild-type groups for these four cate- 
gories and performing a one-way ANOVA of the four groups, we found 
that mutations in both BCOR and RUNX1 correlated with increased 
sensitivity to all four JAK kinase inhibitors, whereas BCOR mutations 
alone or mutations in BCOR together with mutations in DNMT3A or 
SRSF2 showed no difference in sensitivity to the JAK kinase inhibitors— 
although BCOR mutations alone did show sensitivity to other drugs, 
such as the tankyrase/WNT inhibitor XAV-939 and the multi-kinase 
inhibitor crizotinib (Fig. 2c). Collectively, these data suggest that 
dysregulation of the JAK pathway may represent a vulnerability within 
certain settings of specific combinations of mutations and not in others. 

We also performed an integrative analysis of drug sensitivity data 
with respect to patterns of gene expression, comparing the 20% of 
samples with the lowest AUC versus the 20% with the highest AUC 
and assessing the most differentially expressed genes between those 
sample sets. This analysis revealed significant (FDR < 0.05) expression 
signatures for 78 testable drugs in the panel (78 out of 119; 65.5%). 
As an example, the 20% most and least sensitive cases to ibrutinib 
could be clearly distinguished by an expression signature of 17 genes 
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(Fig. 3 and Extended Data Fig. 9). Expression-signature heat maps for 
each drug can be found in our online data browser (http://vizome.org/ 
additional_figures_BeatAML.html). 

Finally, to assess the joint contributions of both mutation and system- 
level co-expression patterns (based on de novo network inference) 
to predicting drug response, multivariate modelling was performed. 
This integrated analysis allows us to move beyond the significant 
associations of single mutations (such as FLT3-ITD and mutations 
in NPM1). We performed a weighted correlation network analysis of 
RNA-sequencing data that identified 14 sets of genes for which the gene 
expression patterns showed significant clustering across the cohort 
(clusters contained both increased and decreased gene expression 
events). We performed regularized regression modelling (LASSO)“° to 
understand how strongly any mutational event or any of these 14 gene 
expression clusters correlated with sensitivity or resistance to any of 
the drugs on the panel. We identified numerous, novel co-occurrences 
of mutations and expression clusters that were associated with drug 
sensitivity or resistance, with co-occurrences with ibrutinib shown 
as an example in Extended Data Fig. 9. For ibrutinib, these co- 
occurrences included a co-expression cluster of 345 genes (using a 
colour-labelling scheme and shown in ‘turquoise’) that correlated with 
drug sensitivity and frequently co-occurred with FLT3-ITD, which also 
correlated with drug sensitivity. There was significant overlap between 
this ‘turquoise’ gene expression cluster and the 17-gene signature in 
Fig. 3 (indicated by the ratio of observed overlap to expected over- 
lap, or the representation factor, which was 13.6; P < 1.734 x 107%). 
It is important to note that network analysis of this gene expression 
cluster highlighted enrichment of a number of immune-related path- 
ways, which was not detected within the 17-gene signature (displayed 
at http://vizome.org/additional_figures_BeatAML.html). We also 
identified a 110-gene subnetwork (labelled in ‘magenta’ in Extended 
Data Fig. 9), which was associated with resistance to ibrutinib and was 
significantly associated with adverse ELN 2017 risk. To look more 
broadly at associations between mutations and gene expression clus- 
ters, we summarized drugs at family level to assess the frequency with 
which mutations and gene expression clusters were selected in itera- 
tive regression modelling (displayed at http://vizome.org/additional_ 
figures_BeatAML.html). 


Discussion 

In summary, we report a large functional genomic dataset of primary 
tumour biopsies. We present a cohort of specimens from patients 
with AML for which we have performed detailed clinical annotations, 
genomic and transcriptomic analyses and ex vivo drug sensitivity studies, 
and we provide analytical approaches for data integration. Each of these 
datasets alone has revealed new information about the biology and 
potential translational approaches in AML, and the integration of these 
datasets has revealed new markers and mechanisms of drug sensitivity 
and resistance that merit further study. These data have all been made 
publicly available through the NIH/NCI dbGaP and Genomic Data 
Commons (GDC) resources, and we have developed tools to facilitate 
user-interfacing with the dataset (http://www.vizome.org/). We hope 
and expect that this public data release will stimulate further use of 
the data, such that novel findings can be derived and turned into new 
clinical approaches for treatment of AML. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
https://doi.org/10. 1038/s41586-018-0623-z. 
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METHODS 


Patient samples. All patients gave informed consent to participate in this study, 
which had the approval and guidance of the Institutional Review Boards (IRB) 
at Oregon Health & Science University (OHSU), University of Utah, University 
of Texas Medical Center (UT Southwestern), Stanford University, University of 
Miami, University of Colorado, University of Florida, National Institutes of Health 
(NIH), Fox Chase Cancer Center and University of Kansas (KUMC). Samples were 
sent to the coordinating centre (OHSU; IRB 9570; 4422; NCT01728402), where 
they were coded and processed. Specific names of centres associated with each 
specimen were coded and centres providing less than five samples were aggregated 
together and given one centre identifier. Mononuclear cells were isolated by Ficoll 
gradient centrifugation from freshly obtained bone marrow aspirates or periph- 
eral blood draws. Cell pellets were snap-frozen in liquid nitrogen for subsequent 
DNA isolation (Qiagen, DNeasy Blood & Tissue Kit), freshly pelleted cells 
were lysed immediately in guanidinium thiocyanate (GTC) lysate for subsequent 
RNA isolation (Qiagen, RNeasy Mini Kit), and freshly isolated mononuclear 
cells were plated into an ex vivo drug sensitivity assays within 24 h (described 
in ‘Ex vivo functional drug screens’). Skin punch biopsies were collected at the 
site of Jamshidi needle insertion for subsequent bone marrow biopsies and 
genomic DNA was isolated for use as matched normal controls for exome sequencing 
(Qiagen, DNeasy Blood & Tissue Kit). Clinical, prognostic, genetic, cytoge- 
netic and pathologic laboratory values as well as treatment and outcome data 
were manually curated from the electronic medical records of the patient. 
Patients were assigned a specific diagnosis based on the prioritization of genetic 
and clinical factors as determined by WHO guidelines. To prevent re-identification, 
any patient over the age of 90 was placed into a >90 aggregated age bracket. 
Genetic characterization of the leukaemia samples included results of a clinical 
deep-sequencing panel of genes commonly mutated in haematologic malignancies 
(Sequenome and GeneTrails (OHSU); Foundation Medicine (UTSW); Genoptix; 
and Illumina). 

Whole-exome sequencing and custom-capture validation sequencing. For 
whole-exome sequencing, [lumina Nextera RapidCapture Exome capture probes 
and protocol were used, which provided coverage of 37 Mb of genomic DNA- 
coding regions. In brief, following initial quality control on a TapeStation (Agilent), 
50 ng of intact genomic DNA was fragmented and tagged (tagmentation) in a single 
step. Following clean-up, the tagmented DNA was amplified by 10 cycles of PCR, 
which added the indexed adaptors for clustering and sequencing. Libraries were 
hybridized to capture pools in 12 sample sets with two rounds of hybridization 
performed to increase specificity. Libraries recovered with streptavidin magnetic 
beads were amplified by 10 cycles of PCR, unincorporated reagents were removed 
with AMPure beads (Agencourt), and validated on the TapeStation. Quantification 
of capture pools was done using real-time PCR (Kapa). Libraries were denatured, 
flow cells set up using the cBot (Illumina), and run on a HiSeq 2500 using paired- 
end 100-cycle protocols. For the AML tumour samples, 5 or 6 lanes were run per 
capture group. For the matched skin biopsy samples, 3 lanes (or equivalent) were 
run per capture group. The instrument and chemistry for all capture groups are 
provided in Supplementary Tables 12, 13. 

For validation of sequencing results, an 11.9-Mb custom-capture library was 

developed that provided coverage of all variants previously reported in AML as 
well as all new variants detected from exome sequencing in this study (genes, var- 
iants and capture regions for this custom library can be found in Supplementary 
Table 14). This capture library was then applied to sequence 96 specimens that had 
previously been subjected to whole-exome sequencing for validation of variant 
calls. 
Whole-exome sequencing data processing. We developed customized analyt- 
ical pipelines that combined published algorithms with novel filtering, curation 
and quality-control steps. Detailed depictions of analytical workflows can be 
found in Supplementary Table 6 and on our online browser (http://vizome.org/ 
additional_figures_BeatAML.html). Initial data processing and alignments were 
performed with commonly used analytical tools. For each flowcell and each sample, 
the FASTQ files were aggregated into single files for read 1 and 2. During this pro- 
cess, these reads were trimmed by 3 on the 5’ end and 5 on the 3’ end. BWA MEM 
version 0.7.10-1789” was used to align the read pairs for each sample-lane FASTQ 
file. As part of this process, the flowcell and lane information was kept as part of 
the read group of the resulting SAM file. The Genome Analysis Toolkit (version 
3.3) and the bundled Picard (version 1.120.1579) were used® for alignment post- 
processing. The files contained within the Broad’s bundle 2.8 were used including 
their version of the build 37 human genome (these files were downloaded from 
ftp://ftp.broadinstitute.org/bundle/2.8/b37/). The following steps were performed 
per sample-lane SAM file generated for each CaptureGroup: (1) the SAM files were 
sorted and converted to BAM via SortSam; (2) MarkDuplicates was run, marking 
both lane level standard and optical duplicates; (3) the reads were realigned around 
insertions and deletions (indels) from the reads using RealignerTargetCreator/ 
IndelRealigner; (4) base quality score recalibration was performed. 


The resulting BAM files were then aggregated by sample and an additional 
round of MarkDuplicates was performed out at the sample level. Quality-control 
reports were generated using the ReportingTools*! and qrqc*® Bioconductor R 
packages along with sequencing core and alignment output files. Each AML sam- 
ple BAM was paired with its skin biopsy pair and an additional round of indel 
realignment was carried out to ensure consistency of genotypes between the two 
samples. If an AML sample did not have a pair, the indel realignment was instead 
done at the sample level. 

Whole-exome sequencing variant detection. For genotyping, each AML-skin 
biopsy pair was realigned at the sample level and then genotyped for single-nucleotide 
variations using Mutect version 1.1.74° and Varscan2 version 2.4.14”. Indels were 
produced using Varscan2. Each variant call format (VCF) file was annotated using 
the Variant Effect Predictor version 83 against GRCh37“. The resulting VCF files 
were filtered to include only those annotated to a gene and were converted to 
mutation annotation format (MAF) format using the vcf2maf version 1.6.6 tool”’. 

Mutect version 1.1.74° was run using default parameters, except that no limit 
was placed on the number or frequency of the alternative allele frequency in the 
normal condition to help to address normal contamination. 

Varscan2 version 2.4.1” was run in somatic mode with the recommended fil- 
tering scheme”, except as shown in Supplementary Table 23. 

Indels and single-nucleotide variants (SNVs) were produced for the tumour- 
only samples again using Mutect without a specified normal for consistency and 
VarScan2 in mpileup2indel or mpileup2snp mode, respectively. These variants 
were assigned to their most deleterious effect on Ensembl transcripts using 
Ensembl VEP version 83 on GRCh37. This assignment was done using the same 
VEP parameters as the vcf2maf (version 1.5.0) program. 

The TCGA AML variants® in MAF form were downloaded from the GDC 
archive site: https://portal.gdc.cancer.gov/legacy-archive/files/c410d927-d49c- 
4d4f-8356-601bee563ebe. The MAF files were converted to VCF files using the 
vcf2maf suite*!. The resulting VCF files were lifted over from NCBI36 to GRCh37 
of the human genome using CrossMap””. Only those variants that successfully 
lifted over were kept. 

Variants from supplementary table 2 from a previously published study" were 
extracted and further processed, removing variants that were ambiguous in terms 
of external sources and could not be found in their whole-exome sequencing 
variants. The unique set of Beat AML variants was annotated relative to RefSeq 
transcripts using Ensembl VEP similar to above and all consequences were kept. 
This set of variants and consequences was searched against the set of processed 
variants from the previously published study". 

Using the runs from MuTect and VarScanz2, these data were next filtered to 
keep only the protein-influencing SNVs and indels from Mutect and VarScan2 
and filtered, requiring that the variants had at least 5 reads and either not be seen 
in the Exome Aggregation Consortium (ExAC)* dataset or be seen at a frequency 
<0.01. These data present several additional challenges. First, somatic calls can- 
not be obtained directly from the tumour-only samples. Second, there is always 
a possibility of tumour contamination of the skin samples for those samples that 
were paired. To address these issues and maximize comparability, we used an iter- 
ative approach. The following was done separately for the two genotypers. (1) An 
initial set of higher-confidence somatic mutations were retrieved from the paired 
tumour-normal samples requiring tumour variant allele frequency (VAF) > 8% 
and normal VAF < 5%, in addition to the significance tests already performed by 
the programs. (2) A list of all candidate mutations was collated requiring that a 
mutation was either seen in the high-confidence somatic set, the set of variants 
from the previously published study“ or from the lifted-over set of variants from 
the TCGA AML paper®. (3) Mutations from the overall set were kept if: (a) the 
overall number of calls in the paired samples was not more than twice the number 
of high-confidence somatic calls; (b) the tumour-only frequency for the calls was 
less than 50% greater than the number of calls in the paired samples; (c) the muta- 
tion was seen in list of the previous study! or TCGA dataset®. (4) High-confidence 
somatic mutations were kept regardless. 

The data from the two genotypers were combined along with FLT3-ITD calls 
from Pindel™*. Comparing our variant lists from whole-exome sequencing amd 
custom-capture validation sequencing, we noticed—similar to others*’—that low 
allele frequency C-to-A variants (<15%) tended to have poor concordance (7.7%; 
data not shown) between the initial run and the technical validation run. These 
variants were removed in these data, along with a curated ‘blacklist’ (Supplementary 
Table 15) of known problematic variants and/or genes, including mitochondrial 
DNA variants. In addition, all variants that were seen in a cumulative list of 
normal samples from Beat AML at a frequency greater than 1% were removed. 
Cumulatively, of this set, 94% of covered SNVs were validated with 82% of indel 
calls also being confirmed with validation sequencing. Manual review was then 
carried out in the following steps. (1) The addition back of all flagged rows from 
the previous study". (2) The review of all TCGA-flagged rows for VAF pattern that 
matched or did not match with known drivers in the same specimen. Some TCGA 
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variants were added back based on convincing VAF pattern and known pathogenic 
role, other TCGA variants were kept excluded based on a VAF pattern that was 
unlike known drivers in the same specimens. (3) Other variants were added back 
based on other specimens that had the same variant that were still on the include 
list, and if the VAF pattern looked convincing for inclusion. (4) All genes from the 
previous study’ with only frameshift and/or nonsense variants were manually 
reviewed and missense mutations were manually removed. (5) Genes and/or var- 
iants that were on both the include and exclude lists were manually reviewed and 
they were removed if they were C-to-A with over 15% VAE they did not validate 
and/or the VAF pattern was unlike known drivers in same specimen. (6) Further 
review of all genes in the summary sheet with cohort frequency of 8 or more (1% of 
more). Any that were not familiar from knowledge of AML literature were manually 
reviewed for VAF patterns that did or did not match known drivers within the same 
specimens. Those that did not match were manually removed. 

After this manual review, additional curated mutations from the 
UnifiedGenotyper run were added back in along with a curated set of variants from 
tumour-only patients for genes from the previous study". The tumour-only AML 
samples used in another study (H.Z. et al., manuscript submitted) were removed 
and used for that study. 

Detection of internal FLT3-ITD and NPM1 mutations. A subset of samples was 
tested for FLT3-ITD and NPM1 mutation status using an internally run PCR assay 
and capillary electrophoresis. Genomic DNA (gDNA) was extracted from fresh 
blood or bone marrow aspirates of patients with AML and was used to detect 
the presence or absence of FLT3-ITD and NPM1 4-bp insertion mutations**°”. 
Primers for FLT3 spanned approximately 330 bp to include the common inter- 
nal duplication site°®. Primers for NPM1 spanned approximately 170 bp to cover 
the clustered multiple insertional or indel sites*”°*. Primers were HPLC-purified 
by the manufacturer. The multiplex PCR reaction solution” consisted of 100 ng 
gDNA, 10 pmol of the respective forward and reverse primers for FLT3 and 
NPM1, 25 mmol |"! MgCl, 2.5 mmol 1"! dNTPs, 5 xl 10x PCR buffer, 0.2 pl 
AccuTaq DNA polymerase and water in a total volume of 50 jl. The PCR con- 
ditions were: initial denaturing for 5 min at 94 °C, followed by 30 cycles at 94 °C 
for 30 s, 56 °C for 45 s and 72 °C for 30 s with a final cycle of 10 min at 72 °C. The 
PCR products were diluted 1:10 and analysed by capillary electrophoresis on a 
QIAxcel high-resolution DNA cartridge according to the manufacturer's protocol. 
Forward primer FLT3: 5'-AGCAATTTAGGTATGAAAGCCAGCTA-3’; reverse 
primer FLT3: 5’-CTTTCAGCATTTTGACGGCAACC-3’. Forward primer NPM1: 
5/-GTTTCTTTTTTTTTTTTTCCAGGCTATTCAAG-3’; reverse primer NPM1: 
5’-CACGGTAGGGAAAGTTCTCACTCTGC-3’. 

Derivation of FLT3-ITD and NPM1 consensus calls. Consensus FLT3-ITD and 
NPM1 mutation calls found in the clinical summary table (Supplementary Table 5) 
were determined by comparing the internal capillary PCR test (internal; according 
to the methods described in ‘Detection of internal FLT3-ITD and NPM1 muta- 
tions’), the Clinical Laboratory Improvement Amendments/College of American 
Pathology (CLIA/CAP) laboratory run test (Sequenome, GeneTrails, Foundation 
Medicine, Genoptix, Illumina). The internal test was used for the sample consen- 
sus call when available, as it was performed on the exact sample that was used for 
further ex vivo drug sensitivity assays. Where discordance existed between the 
internal test and the CLIA laboratory test results, the sample was flagged for manual 
review. The trace file for the internal test was visually inspected and if discordance 
with the CLIA/CAP test results persisted, the whole-exome sequencing data were 
then used to help to determine the consensus call. 

Derivation of CCAAT enhancer binding protein « (CEBPA) biallelic consen- 
sus calls, N-terminal and C-terminal mutations have been described to occur on 
opposing alleles and patients with CEBPA biallelic mutations have been shown 
to fall into a favourable risk category™. Patients were scored positive for biallelic 
CEBPA mutation if described in the clinical notes as biallelic or double-positive. 
Patients were also scored as CEBPA biallelic if both N-terminal and C-terminal 
mutations were identified in the whole-exome sequencing data. 

RNA sequencing and data processing. All samples were sequenced using the 
Agilent SureSelect Strand-Specific RNA Library Preparation Kit on the Bravo robot 
(Agilent). In brief, poly(A)+ RNA was chemically fragmented. Double-stranded 
cDNAs were synthesized using random hexamer priming with 3’ ends of the 
cDNA adenylated, after which indexed adaptors were ligated. Library amplifi- 
cation was performed using three-primer PCR using a uracil DNA glycosylase 
addition for strandedness. Libraries were validated with the Bioanalyzer (Agilent) 
and combined to run 4 samples per lane, with a targeted yield of 200 million clus- 
ters. Combined libraries were denatured, clustered with the cBot (Illumina) and 
sequenced on the HiSeq 2500 using a 100-cycle paired-end protocol. In addition 
to the AML samples, there was also a sample of purified CD34 molecule (CD34) 
cells from healthy control bone marrow, which was included in each sample group 
(for a total of 12 times sequencing this control RNA). This control served as both 
a healthy control and a quality check on inter-group batch effects. In addition, 
21 individual healthy bone marrow samples were also included, two of which were 
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CD34-selected (17-00053 and 17-00056) with the other 19 being whole mono- 
nuclear bone marrow cells from healthy donors. 

Workflows for processing and analysis of RNA-sequencing data to generate gene 
counts and gene fusions for each sample are shown on our online browser (http:// 
vizome.org/additional_figures_BeatAML.html) with processed gene expression 
values for each specimen listed in Supplementary Tables 8, 9. For each flowcell and 
each sample, the FASTQ files were aggregated into single files for read 1 and read 2 
(if not already done by the sequencing core). During this process, these reads were 
trimmed by 3 on the 5’ end and 5 on the 3’ end. Alignment of reads was performed 
using the subjunc aligner (version 1.5.0-p2)°. BAM files obtained from subjunc 
were used as inputs into featureCounts (version 1.5.0-p2) and gene-level read 
counts were produced. For a reference genome, the GRCh37 build provided by 
the Broad as part of the GATK bundle was used. Gene assignments were based on 
the Ensembl build 75 gene models on GRCh37. The following parameters were 
used for the software: 

subjunc -i /path/to/reference/ -u -r fastq1 -R fastq2 -o outputBAMFilename -I 
5 -T 7 -d 50 -D 600 -S fr and featureCounts -a Homo_sapiens.GRCh37.75.gtf -o 
output -F GTF -t exon -g gene_id -s 2 -C -T 10 -p -P -d 50 -D 600 -B BAM_files. 

The data were collated from featureCounts matrices and all genes with no 
counts across the samples were excluded. Genes with duplicate gene symbols and 
those for wich the counts were <10 for 90% or more of the samples were addition- 
ally removed before normalization similar to the approach suggested for weighted 
gene correlation network analysis (WGCNA)®. Samples for which the median 
expression was less than 2 standard deviations below the average were removed 
from the dataset (n = 10). Normalization was performed using the conditional 
quantile normalization procedure™, which produced GC-content-corrected logy 
reads per kilobase per million mapped reads (RPKM) values. This procedure 
produces both offsets to be used in conjunction with edgeR as well as a matrix of 
log»-normalized RPKM values for clustering. 

In addition, the subjunc BAM files were processed using the RNA-sequencing 
genotyping protocol (as of GATK version 3.3), which was similar to the whole- 
exome sequencing protocol described in the ‘Whole-exome sequencing data pro- 
cessing’ section, including the following steps for each sample: (1) MarkDuplicates; 
(2) SplitNCigarReads; (3) RealignerTargetCreator/IndelRealigner; (4) base quality 
score recalibration. The resulting BAM files were used to produce RNA geno- 
types using the UnifiedGenotyper for the purposes of quality control and ethnicity 
estimation. 

Gene fusion data were additionally generated using the TopHat-Fusion (version 
2.0.14) program using default parameters®. 

Coexpression network formation. We formed coexpression modules using the 
WGCNA procedure on the RNA-sequencing data from the ‘RNA sequencing and 
data processing’ section. All RNA-sequencing samples were used to form the set 
of modules. Owing to the heterogeneity of the clinical expression data, we gener- 
ated ‘signed hybrid’ networks using the ‘bicor’ correlation®, setting the maximum 
proportion of outliers to 0.1. We ran the procedure multiple times, varying several 
parameters to choose the most relevant set for further analysis. The WGCNA 
procedure was run on datasets formed from the top 2,000 and 5,000 most variable 
genes. For each dataset, we set the ‘power’ variable to either 2 or 3. For each of these 
runs, we varied the module detection parameters of dynamicTreeCut® , namely 
the deepSplit parameter was set to 0 or 2 and the pamStage parameter was set to 
TRUE or FALSE. For each of these sets of modules we computed a series of module 
quality statistics®’, mean correlation, mean adjacency, mean maximum adjacency 
ratio (MAR), mean correlation with the module eigengene (KME), proportion of 
variability explained and the mean cluster coefficient. Significance of modules was 
determined by computing a z score of each of these values relative to the mean and 
standard deviation of those from 100 random assignment of modules. We chose 
the set of modules to use in our analyses as those that were most correlated with 
the ‘specimentSpecificDx using module quality as a tie-breaker. The analysis set of 
modules was chosen to be the version using the 5,000 most variable genes, power 
set to 2 and modules formed using deepSplit = 2 and pamStage = FE. Of this set 
of modules, only the grey module did not have a summary Z statistic (median 
across the four density measures) of at least 2. In addition, after correcting the data 
using the estimated principal components®, the module structure did not change 
appreciably (data not shown). 

Quality control. The UnifiedGenotyper runs for both the whole-exome sequenc- 
ing and RNA-sequencing data were combined into a single VCF file using the 
GATK CombineVCFs functionality. This combined VCF file was converted to a 
GDS file using SNPRelate (version 1.12.2). Note, the version is the most recent 
version as several versions were used across the entire project. The overall similarity 
of the genotypes of each pair of samples was computed, termed identity by state 
(IBS) and hierarchical clustering was performed using one minus this similarity. 
From this clustering and visualization we had devised hard cut-offs for further 
inspection based on the types of data being compared. For instance, samples 
not meeting the specified IBS thresholds (DNA-DNA = 0.9; RNA-RNA = 0.83; 
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DNA-RNA = 0.89) were subject to manual review. On the basis of the dendro- 
gram structure as well as the clinical/laboratory information, samples were either 
excluded, assigned to a different patient ID or in rare cases assigned to a different 
sample. It was observed that bone marrow transplants between sample collections 
produced a noticeable but milder effect in these dendrograms and such samples 
were flagged for removal from the RNA-sequencing analysis and for treatment 
as tumour-only samples in the whole-exome sequencing analysis as described in 
‘Whole-exome sequencing variant detection. 

Fusion annotation for analysis. Fusions calls were determined from a consensus 
of three datatypes, a specific diagnosis categorization at the time of sample acqui- 
sition, current set of clinical karyotypes and fusions detected in RNA-sequencing 
data by TopHat-Fusion. All sources were limited to the same set of known fusions: 
RUNX1-RUNX1T1, CBFB-MYH11, MLLT3-KMT2A, DEK-NUP214, GATA2- 
MECOM and PML-RARA. It was determined that the RNA-sequencing calls did 
not provide additional resolution in detecting these known fusions and was not 
performed on all the samples, so the consensus was limited to the clinical karyotype 
calls as well as the specific diagnosis categorization (which was determined based 
on karyotype and other cytogenetic clinical tests). Overall, the calls were based on 
the karyotype data except in 10 cases; for 3 cases the karyotype and diagnosis was 
sufficiently complex to warrant a separate ‘complex’ categorization. The remaining 
7 of these cases were set to the specific diagnosis classification. It should be noted 
that there was additional support from the RNA-sequencing data for several of 
these cases. 

Ethnicity. The combined RNA and whole-exome sequencing VCF files from the 
‘Quality contro? step were merged with a set of Hapmap genotypes” lifted over to 
build 37. The SNPRelate package was used to convert the VCF to GDS, perform 
linkage disequilibrium (LD) pruning using an LD threshold of 0.2, MAF cut-off of 
0.05 and allowing a missing rate of 0.3 and calculation of the principal components. 
Previously published methodology’! was used to assign admixture proportions 
relative to the HapMap samples using the principal components. Each sample was 
assigned to an ethnicity group based on the group with the maximum admixture 
proportion. If the maximum was 50% or less, we labelled it Admixed. As we had 
observed previously that the clustering of ethnicities for RNA-sequencing samples 
are more diffuse than exome sequencing, we assigned the final inferred ethnicity to 
each patient based on the distinct whole-exome sequencing calls if available, defer- 
ring to RNA sequencing only if not available. If multiple exome sequencing samples 
were present with discrepant calls, the data of the patient were manually reviewed. 
The only patient for which this occurred was patient 4043, a self-identified 
Hispanic who had two RNA samples and an exome sequencing sample inferred as 
‘White’ and one exome sample inferred to be ‘HispNative. Only the ‘White’ call for 
the exome sequencing had an admixture proportion over 0.5. The patient was kept 
consistent with the self-identification and labelled as ‘HispNative. 

Sex. For DNA, coverage was first computed over the Y chromosome and the counts 
for each sample were added up and log-transformed (after adding 1 to all the 
counts). k-means clustering was used to assign samples to two clusters with the 
cluster with the lower mean labelled as the ‘Female’ cluster. 

For RNA sequencing, counts were converted to counts per million after 

applying the Trimmed Mean of M scaling normalization”. A set of 28 genes 
were chosen to successfully discriminate the genders using DE analysis over 
multiple studies (data not shown) and were used in conjunction with k-means 
clustering to form two clusters. The ‘Female’ cluster was labelled based on high 
XIST expression. 
ELN 2017 classification. This procedure is based on the categorization in table 5 
of the 2017 ELN update paper”. Karyotypes in the clinical file were first cleaned 
and parsed into clones or subclones and distinct abnormalities using standard 
conventions’*, The current representation was corrected for nomenclature type 
(for example, ‘ideny versus ‘s!’) in a basic manner. For instance, ambiguous events, 
such as chromosomal loss (for example, —15), were not corrected for whether 
the preceding clone had a counteracting gain. Also, additional ‘+’ or “—’ symbols 
in conjunction with valid karyotype operators in a separate clone (for example, 
+del(12)(q?15) or —del(12)(q?15))) were treated separately with gains (++) being 
kept in the unique count of events and losses (—) being removed. 

Abnormalities were first checked for the following categories: (1) RUNX1- 
RUNX1T1; (2) CBFB-MYH11; (3) MLLT3-KMT2A; (4) DEK-NUP214; 
(5) KMT2A-*; (6) BCR-ABL1; (7) GATA2-MECOM; (8) —5/del(5q); (9) —7; 
(10) —17. Categories 1-8 were further considered to be WHO recurrent fusions. 

The number of unique abnormalities (across clones) was then computed. 
Whether or not a karyotype was considered to be polyploid was also recorded 
(at least 60 chromosomes or ‘(> 3)n’ or label). NPM1, FLT3-ITD and biallelic 
CEBPA were derived from consensus calls. FLT3-ITD allelic ratios were deter- 
mined only for the samples with an internal assay. The MAF values of the inter- 
nal assay were converted to a ratio using the formula MAF/(1 — MAF). RUNX1, 
ASXL1 and TP53 were derived from the clinical genotypes. Abnormal 17 calls were 
manually curated from the karyotype data and clinical genotype calls. 


The determination of ELN 2017 categories proceeds by assigning true/false/not 
available values to one or more of the five columns (three ELN and two ambiguous) 
in the following manner. (1) ‘isFavourable is considered true if a sample has at least 
one of the following: (a) RUNX1-RUNX1T1; (b) CBFB-MYH11; (c) positive NPM1 
and negative FLT3-ITD; (d) positive NPM1 and positive FLT3-ITD with allelic 
ratio <0.5; (e) biallelic CEBPA. (2) ‘isFavourableOrIntermediate’ when NPM1 is 
positive and FLT3-ITD is positive but the allelic ratio is not available. (3) ‘isAdverse 
is considered true if a sample has at least one of the following: (a) DEK-NUP214; (b) 
KMT2A-*; (c) BCR-ABL1; (d) GATA2-MECOM; (e) —5/del(5q); (f) —7; (g) —17; 
(h) abn_17; (i) three or more abnormalities and no WHO recurrent fusions; (j) one 
monosomy (autosomal) and at least one additional abnormality except for CBFB- 
MYHI11; (k) positive RUNX1 or ASXL1 and not considered to be ‘isFavourable’ or 
‘isFavourableOrIntermediate’; (1) positive TP53; (4) ‘isIntermediate’ is considered 
true if a sample has at least one of the following: (a) MLLT3-KMT2A; (b) NPM1 is 
positive and FLT3-ITD is positive with allelic ratio >0.5; (c) NPM1 is negative and 
FLT3-ITD is negative or has a low allelic ratio (<0.5); (d) at least one abnormality 
and is not considered ‘isFavourable or ‘isAdverse’; (5) ‘isIntermediateOrAdverse’ 
when NPM1 is negative and FLT3-ITD is positive without an allelic ratio. Calls 
were annotated as ‘not available’ in the absence of FLT3-ITD or NPM1 calls. 

Samples for which the specific diagnosis at inclusion indicated ‘acute promyelo- 
cytic leukaemia with t(15;17)(q22;q12)’ were automatically set to ‘Favourable. Any 
overlaps in the categories were resolved based on manual expert review. 

Ex vivo functional drug screens. Ex vivo functional drug screens were performed 
on freshly isolated mononuclear cells from AML samples. In brief, 10,000 cells per 
well were arrayed into three, 384-well plates containing 122 small-molecule inhib- 
itors. This panel contained graded concentrations of drugs with activity against 
two-thirds of the tyrosine kinome as well as other non-tyrosine kinase pathways, 
including mitogen-activated protein kinases (MAPKs), the pathway involving 
phosphatidylinositol-4,5-bisphosphate 3-kinase, AKT serine/threonine kinase 1 
and mechanistic target of rapamycin kinase (PIK3C-AKT-MTOR)); protein kinase 
AMP-activated (AMPK, also known as PRKAA1), ATM serine/threonine kinase 
(ATM), Aurora kinases, calcium/calmodulin-dependent protein kinases (CAMKs), 
cyclin-dependent kinases (CDKs), serine/threonine protein kinase 3 (GSK3), 
IkB kinase (IKK), cAMP-dependent protein kinase (PKA), protein kinase C 
(PKC), polo-like kinase 1 (PLK1) and RAF proto-oncogene serine/threonine 
kinase (RAF). In addition, the library contained small-molecule inhibitors with 
activity against the BCL2 family, bromodomain containing 4 (BRD4), Hedgehog, 
heat shock protein 90 (HSP90), NOTCH/*-secretase, proteasome, survivin, signal 
transducer and activator of transcription 3 (STAT3), histone deacetylase (HDAC), 
and WNT/@-catenin. Drug plates were created using inhibitors purchased from 
LC Laboratories and Selleck Chemicals and master stocks were reconstituted in 
dimethyl] sulfoxide (DMSO) and stored at —80 °C. Master plates were created by 
distributing a single agent per well in a seven-point concentration series, created 
from threefold dilutions of the most concentrated stock resulting in a range of 
10 {tM to 0.0137 \1M for each drug (except dasatinib, ponatinib, sunitinib and 
YM-155, which were plated at a concentration range of 1 \tM to 0.00137 ,M). 
DMSO-control wells and positive-control wells containing a drug combination of 
flavopiridol, staurosporine and velcade were placed on each plate, with the final 
concentration of DMSO <0.1% in all wells. Daughter plates were created using 
a V&P Scientific 384-well pin tool head operated by the Caliper Sciclone ALH 
3000 and equipped with 0.457-mm diameter, 30-nl, slotted stainless-steel pins 
(FP1NS30). Daughter and destination plates were sealed with pealable thermal 
seals using a PlateLoc thermal sealer. Destination plates were stored at —20 °C for 
no more than three months and thawed immediately before use. Primary mononu- 
clear cells were plated across single-agent inhibitor panels within 24 h of collection. 
Cells were seeded into 384-well assay plates at 10,000 cells per well in Roswell 
Park Memorial Institute (RPMI) 1640 medium supplemented with fetal bovine 
serum (FBS) (10%), L-glutamine, penicillin-streptomycin, and 3-mercaptoethanol 
(10-4 M). After three days of culture at 37 °C in 5% CO», MTS reagent (CellTiter96 
AQueous One; Promega) was added, the optical density was measured at 490 nm, 
and raw absorbance values were adjusted to a reference blank value and then used 
to determine cell viability (normalized to untreated control wells). 

Ex vivo functional drug screen data processing. A workflow in which the data 
normalization, curve fit parameters and quality assurance/quality control steps are 
summarized can be found on our online browser (http://vizome.org/additional_ 
figures_BeatAML.html) with processed drug response data for each specimen 
listed in Supplementary Table 10. A given sample was run on one or more panels 
and within each panel, the majority of drugs were run without within-panel rep- 
licates. Two steps were performed to harmonize these data before model fitting. 

First, a ‘curve-free’ AUC (integration based on fine linear interpolation between 
the seven data points themselves) was calculated for those runs with within-panel 
replicates after applying a ceiling of 100 and a floor of 0 for the normalized viability. 
The maximum change in AUC among the replicates was noted and those runs with 
differences >100 were removed. 
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Second, the remaining within-plate replicates had their normalized viability 
averaged subject to a ceiling of 100 and floor of 0. An additional set of ‘curve- 
free’ AUCs was computed for sample-inhibitor pairs run on multiple panels. The 
maximum change in AUC among the across-panel replicates was noted and those 
runs with differences >75 were removed. 

At this point, the within- and across-plate replicates for the normalized viability 
were averaged together and a ceiling of 100 was applied. From the steps above, 
the floor was already at 0. On the basis of the methodology used in our prior 
drug-combination study”, a probit regression was fit to all possible run groups 
using the model: (normalized viability / 100) ~ 1 + logjo(concentration). For all 
groups there were n = 7 dose-response measurements. 

The summary analyses of curve fit were inspected and cut-offs were devised 
removing all runs with an Akaike information criteria (AIC) > 12 and deviance 
>2. For inhibitors that were run using multiple concentration ranges, only the 
most-recent concentration range was kept. Finally, these data were compared to the 
AUC values from third-order polynomial fits. Those runs that were discrepant in 
terms of sensitive or resistant calls were manually reviewed as subject to removal. 
Ex vivo functional drug screen analysis. For all drug analyses that required a call 
of sensitivity or resistance (for example, the gene expression signatures), sensitivity 
or resistance was determined by the lowest and highest 20% of the AUC values 
for each drug. 

Correlations between drugs in families. For each inhibitor in the study, available 
data on targets of the inhibitors were pulled from a variety of online resources and 
published studies, many of which were aggregated in the Cancer-Targetome’™”®. 
Activity of each inhibitor for targets was then distilled into a five-tier system to 
afford comparison across drugs with differing degrees of potency and/or for which 
differing assays were used to measure drug and/or target activity. Well-represented 
genes, gene families and pathways were then filtered for drugs that have activity 
in the top 3 tiers for one or more member of the gene family or pathway. These 
lists were then manually curated to generate the final list of high-confidence drug 
target families (shown in Supplementary Table 11). For each inhibitor assigned to 
at least one drug target family, the Pearson's correlation was computed against all 
other drugs assigned to at least one drug target family for the AUC values of all 
available samples shared between the two drugs. 

Correlations between drugs and samples. Drugs were first filtered to require greater 
than two hundred samples per drug. Additional samples were removed accordingly 
to allow correlations to be computed between all present samples using available 
AUC data and between all drugs. 

Summary drug response scores. For each patient sample, a binary score (1/0) was 
used for each drug based on the same threshold as for the gene signatures (for 
example, sensitivity or resistance was determined by the lowest and highest 20% 
of the AUC values for each drug). Individual scores were computed for resistance 
and sensitivity separately and presented as the proportion over all screened drugs 
for each patient sample. 

Expression analysis and integration with ex vivo functional drug screen. For all 
the below analyses the earliest sample was chosen for each patient. 

Expression heat map. The top 2,000 most variable genes were extracted. The 
expression values were centred and scaled across patients and complete-linkage 
hierarchical clustering was performed using the ComplexHeat map R package”’. 

Sensitive or resistant differential expression. For each drug, it was required that 
at least three sensitive and three resistant samples using the 20%/20% criteria 
outlined in the ‘Ex vivo functional drug screen analysis’ section. Patient samples 
were limited to those labelled as sensitive or resistant. Next, genes were limited 
based on their expression, for which at least half the patients used for analysis 
had to have greater than one count per million (an approach suggested in the 
limma users manual)’. The normalized expression as in the data described in 
the ‘RNA sequencing and data processing’ section with the chosen samples and 
genes was used for differential expression analysis. Because the data had not been 
batch-corrected at this point, surrogate variable analysis (SVA)” was used to infer 
covariates for correcting out technical confounders. Next, the linear model fitting 
for each gene was performed using the limma-trend approach, testing whether 
the average expression was different between resistant and sensitive correcting for 
the SVA covariates. Genes with Benjamini-Hochberg** FDR values of less than 
0.05 were kept for the cluster analysis. The expression matrix was corrected with 
respect to the estimated surrogate variables for consistency with the differential 
expression procedure using fSVA*! and MCLUST® was used to determine optimal 
number of clusters and parameterization. The results were then visualized using 
a CLUSPLOT®, which displays the clustering results with respect to the first two 
principal components of the gene expression for the kept genes. 

Mutation analysis and integration with ex vivo functional drug screen. For all 
the below analyses in which groups of samples were compared, the earliest sample 
was chosen for each patient. 

TCGA comparison. The lifted-over TCGA variants from the “‘Whole-exome 
sequencing variant detection section were annotated using the VEP from Ensembl 
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build 83, filtered for protein-altering and splice site variants and our ‘blacklist’ was 
applied to ensure the variant sets were comparable. 

Co-occurrence and mutual exclusivity. Only mutations seen in at least 10 patients 
were kept. The DISCOVER" method was used to determine significant mutual 
exclusivity and co-occurrence. A plot of the co-occurrences was generated using 
corrplot* with the odds ratio of the pairwise co-occurrence used to colour and 
scale the circle sizes. 

Association between mutations and drugs. For each mutated gene in the exome 
sequencing samples and each recurrent fusion (counting FLT3-ITD as a distinct 
entity from other FLT3 mutants), we determined all available (at least 5 patients) 
pairwise and three-way co-occurrence sets. For each drug and each valid set of 
genes (from one to three genes), we fitted a linear model with AUC as the response 
and examined the linear contrast (that is, two-sided Student's t-test) comparing the 
AUC of the gene(s) to the appropriate negative. For example the average AUC of 
the FLT3, DNMT3A and NPM1 mutants would be compared to average AUC of 
the samples negative for all three genes. FDR was computed using the Benjamini- 
Hochberg method over all the drugs. 

For the ibrutinib and entospletinib comparisons, the presence and absence of 
the three genes or mutations: NPM1, FLT3-ITD and DNMT3A was collapsed into 
levels of a single factor. The corresponding single-factor ANOVA was carried out 
with the ‘triple-negative’ category set as the reference. Significance of the P values 
of each coefficient was compared to the Bonferroni-corrected 0.05 level. 

For the JAK-family analysis, the AUC values were pooled for the four JAK 
inhibitors (CYT387, tofacitinib (CP-690550), JAK inhibitor I, ruxolitinib 
(INCB018424)) for each gene mutation set (BCOR, BCOR and DNMT3A, BCOR 
and RUNX1, BCOR and SRSF2). The contrast of the difference between BCOR and 
RUNX1 samples and the average of the other three mutation groups was tested. 
Integration of both mutation and RNA sequencing with ex vivo functional drug 
screen. Mutations (0/1 score) and the module Eigengenes from the WGCNA analysis 
were used separately and combined together in regression models with coefficients 
selected using the LASSO approach” as implemented in glmnet*®. For each data type 
and the combination, only drugs with at least 200 patients samples were tested. The 
three datasets were initially randomly separated into training (75%) and test (25%) 
sets. Similar to a previous approach*, a bootstrap aggregation approach was used 
in which the 1,000 bootstraps of the training dataset were generated and for each 
one, the LASSO was trained using tenfold cross-validation. Predictions were formed 
for the test dataset over these bootstrap models and the predicted AUC was aver- 
aged. R? values were computed for these aggregated predictions relative to the test 
AUC values. As performance was seen to be dependent on the initial split between 
test and training, we repeated the entire process 100 times, recording the mean and 
standard deviation of the R? value as well as the count non-zero coefficients 
Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability 

All raw and processed sequencing data, along with relevant clinical annotations, 
have been submitted to dbGaP and Genomic Data Commons. The dbGaP study ID 
is 30641 and accession ID is phs001657.v1.p1. The raw data for clinical annotations, 
variant calls, gene expression counts and drug sensitivity that underlie Figs. 1-3 
and Extended Data Figs. 1-9 are provided as Source Data. In addition, all data can 
be accessed and queried through our online, interactive user interface, Vizome, at 
http://www.vizome.org/. 
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Extended Data Fig. 1 | Genomic landscape of the Beat AML cohort. 

In total, 622 specimens from 531 patients were used for whole-exome 
sequencing. Automated and manual curation steps (described in 

the Methods, Supplementary Information and at http://vizome.org/ 
additional_figures_BeatAML.html) were used to obtain a final set of high- 
confidence variants (annotated in Supplementary Table 7) and the earliest 
sample for each individual patient was used in this analysis. Clinical 
cytogenetics and gene fusion calls from RNA sequencing were used to 
curate recurrent gene rearrangements (Supplementary Information). 

The mutational profile for each patient is shown for frequency-ranked 
mutational events (top) and frequency-ranked gene rearrangements 
(bottom). The mosaic plot is annotated with clinical features of each case, 
such as diagnosis or relapse and de novo or transformed disease states, 
and the first bar chart on the right summarizes the cohort frequencies of 
mutational and gene rearrangement events. The last bar chart on the right 
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summarizes the frequency of significant drug-mutation associations for 
the given gene across the cohort with drug sensitivity displayed in red and 
drug resistance displayed in blue. Eleven genes that have not previously 
been reported to be somatically mutated in cancer were observed with 
mutations at approximately 1% cohort frequency: CUB and Sushi 
multiple domains 2 (CSMD2), NAC alpha domain containing (NACAD), 
teneurin transmembrane protein 2 (TENM2), aggrecan (ACAN), ADAM 
metallopeptidase with thrombospondin type 1 motif 7 (ADAMTS7), 
immunoglobulin-like and fibronectin type II domain containing 1 
(IGFN1), neurobeachin-like 2 (NBEAL2), poly(U) binding splicing factor 
60 (PUF60), zinc-finger protein 687 (ZNF687), cadherin EGF LAG seven- 
pass G-type receptor 2 (CELSR2) and glutamate ionotropic receptor 
NMDA type subunit 2B (GRIN2B). For the number of samples used to 
correlate each drug with mutations, see Supplementary Table 17. 
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Extended Data Fig. 6 | See next page for caption. 


Extended Data Fig. 6 | Binary drug response calls and correlation with 
clinical subsets. a, For the intersect of every specimen with evaluable 
response data for each inhibitor, we created a threshold for binary 
sensitive or resistant calls based on whether the individual specimen 
response fell within the most sensitive 20% of all specimens tested against 
that drug. A matrix plot showing the unsupervised clustering of the binary 
calls can be found at http://vizome.org/additional_figures_BeatAML.html. 
The binary drug-resistance calls for each specimen were combined into a 
single value, representing the proportion of drugs to which an individual 
specimen was sensitive (left) or resistant (right). Specimens were divided 
into ‘Favourable’ and ‘Adverse’ groups based on ELN 2017 criteria to 


ARTICLE 


determine whether overall drug sensitivity or resistance correlated with 
prognostic features of disease (n = 233 patients). b, The binary drug- 
resistance calls for each specimen as in a. Specimens were divided into 
diagnostic groups based on WHO 2016 categories to determine whether 
overall drug sensitivity or resistance correlated with cytogenetic or 
morphologic features of disease (n = 340 patients). a, b, The top and 
bottom points of the box plots show 1.5 times the interquartile range 
(IQR) from the upper and lower lines; the top, middle and bottom lines 
indicate the 75th, median and 25th percentile of the data with the notches 
extending 1.58 x IQR/ (V(n)). Specific sample sizes of each group are 
reported in Supplementary Table 22. 
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Extended Data Fig. 7 | Integration of genetic events with drug 
sensitivity. a, Circos plot showing AML rearrangements in the centre, 
mutational events in the next concentric ring, and gene expression events 
in the outer ring. The size and width indicates frequency of the event 

and the FDR-corrected Q value of association with drug sensitivity is 
colour-coded (sensitivity (red); resistance (blue)). For each gene, tests 
involving expression were two-sided Student's t-tests (linear model) of the 
differences between sensitive and resistant samples. For mutational events, 
the average difference in AUC between mutant and wild-type samples 
was determined using two-sided Student’s t-tests from a linear model as 
shown in Fig. 2a. FDR was computed using the Benjamini-Hochberg 
method over all the drugs. The number of samples used to correlate each 
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mutational event with drug sensitivity is reported in Supplementary 
Table 17. b, As in Fig. 2a, the average difference in AUC drug response 
between mutant and wild-type cases was determined using a two-sided 
Student's t-test from a linear model fit (plotted on the x axis and the FDR- 
corrected Q value is plotted on the y axis). This analysis shows only FLT3- 
ITD-negative cases. FDR was computed using the Benjamini-Hochberg 
method over all the drugs. The number of samples used to correlate each 
mutational event with drug sensitivity is reported in Supplementary 
Table 17. Expanded and interactive plots are available in our online data 
browser (http://www.vizome.org/ and http://vizome.org/additional_ 
figures_BeatAML.html). 
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Extended Data Fig. 8 | Integration of drug sensitivity with genetic over all the drugs. The degree of significance is represented on the y 
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average difference in AUC drug response between mutant and wild-type to correlate each mutational event with drug sensitivity is reported in 
cases was determined using a two-sided Student's t-test from a linear Supplementary Table 17. 


model fit. FDR was computed using the Benjamini-Hochberg method 
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line). The variables appear in descending order clockwise starting at 
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Pathogen elimination by probiotic 
Bacillus via signalling interference 


Pipat Piewngam!”, Yue Zheng!, Thuan H. Nguyen! , Seth W. Dickey!, Hwang-Soo Joo!*, Amer E. Villaruz!, Kyle A. Glose!, 
Emilie L. Fisher!, Rachelle L. Hunt', Barry Li!, Janice Chiou', Sujiraphong Pharkjaksu?, Sunisa Khongthong?, 


Gordon Y. C. Cheung", Pattarachai Kiratisin? & Michael Otto!* 


Probiotic nutrition is frequently claimed to improve human health. In particular, live probiotic bacteria obtained with 
food are thought to reduce intestinal colonization by pathogens, and thus to reduce susceptibility to infection. However, 
the mechanisms that underlie these effects remain poorly understood. Here we report that the consumption of probiotic 
Bacillus bacteria comprehensively abolished colonization by the dangerous pathogen Staphylococcus aureus in a rural Thai 
population. We show that a widespread class of Bacillus lipopeptides, the fengycins, eliminates S. aureus by inhibiting 
S. aureus quorum sensing—a process through which bacteria respond to their population density by altering gene regulation. 
Our study presents a detailed molecular mechanism that underlines the importance of probiotic nutrition in reducing 
infectious disease. We also provide evidence that supports the biological significance of probiotic bacterial interference 
in humans, and show that such interference can be achieved by blocking a pathogen’s signalling system. Furthermore, 
our findings suggest a probiotic-based method for S. aureus decolonization and new ways to fight S. aureus infections. 


There is increasing appreciation of the key role that the intestinal 
microbiota play in preventing the colonization and overgrowth of 
pathogens”. The mechanisms that have been implicated in this ben- 
eficial function of probiotic bacteria are mostly indirect, and include 
modulation of the immune system, enhancement of the intestinal epi- 
thelial barrier, or competition with pathogens for nutrients”>. Whether 
there is direct interference between probiotic and pathogenic bacte- 
ria is less clear. Some probiotic strains produce bacteriocin proteins, 
which can kill phylogenetically related pathogenic bacteria’, and it has 
been shown that a bacteriocin-producing Escherichia coli strain inhib- 
its colonization by related pathogenic bacteria in the inflamed gut of 
mice®. However, no evidence has been obtained to indicate that such 
mechanisms matter or are widespread in humans. Furthermore, it is 
not known whether there are mechanisms for direct probiotic bacterial 
interference that are not mediated by bacteriocins. 

The genus Bacillus comprises different species of soil bacteria that 
form endospores with the ability to survive harsh environmental condi- 
tions, such as the high temperatures encountered during cooking pro- 
cedures. Bacillus spores are commonly ingested with vegetables’. They 
can subsequently germinate to form metabolically active, vegetative 
cells’, which can temporarily colonize the intestinal tract?. Given the 
variability in dietary customs, the concentration of Bacillus spores in 
human faeces is also highly variable. It has been reported to be around 
10° colony-forming units (CFU) per gram on average, occasionally 
reaching up to 10° CFU per gram’. Several probiotic formulae contain 
Bacillus species’°, which are thought to reduce pathogen colonization 
by mechanisms that—except for a described immune-stimulatory effect 
on epithelial cells''—remain poorly defined. 

Staphylococcus aureus is a widespread and dangerous human patho- 
gen that can cause a variety of diseases, ranging from moderately severe 
skin infections to fatal pneumonia and sepsis'*. Treatment of S. aureus 
infections is severely complicated by antibiotic resistance!?, such as 
in methicillin-resistant S. aureus (MRSA), and there is no working 


S. aureus vaccine", Therefore, alternative strategies to combat S. aureus 
infections are eagerly sought’>. Because S. aureus infections commonly 
originate from previous asymptomatic colonization!®”’, decolonization 
has recently gained considerable attention as a possible means to fight 
S. aureus infections in a preventive manner!®, While the nares (nostrils) 
have traditionally been considered the primary S. aureus colonization 
site!’, there is increasing evidence that the intestinal tract is also com- 
monly colonized by S. aureus**-*? and forms an important reservoir 
for outbreaks of infectious S. aureus disease™*4. Several studies have 
reported levels of S. aureus in the faeces of human adults of around 
10°-10* CFU per gram?>*”, Possibly, intestinal S. aureus colonization 
explains the failure of previous topical decolonization efforts aimed 
solely at the nose!®?8, 

Here we hypothesized that the composition of the human gut micro- 
biota affects intestinal colonization with S. aureus. To evaluate that 
hypothesis, we collected faecal samples from 200 healthy individuals 
from rural populations in Thailand (Fig. 1a). This exemplary population 
was selected in order to rule out, as much as possible, the food steriliza- 
tion and antibiotic usage that are common in highly developed urban 
areas, which potentially could diminish the abundance of probiotic bac- 
teria in the food and intestinal tracts of the participating subjects. Our 
analysis revealed a comprehensive Bacillus-mediated S. aureus exclusion 
effect in the human population. By demonstrating that quorum sensing 
is indispensable for S. aureus to colonize the intestine, and discovering 
that secreted Bacillus fengycin lipopeptides function as quorum-sensing 
blockers to achieve complete eradication of intestinal S. aureus, we 
provide evidence that strongly suggests that this pathogen-exclusion 
effect in humans is due to a widespread and efficient probiotic-mediated 
mechanism that inhibits pathogen quorum-sensing signalling. 


S. aureus exclusion by Bacillus 
We found that 25/200 (12.5%) of human subjects carried S. aureus in 
their intestines, as determined by growth from faecal samples. Nasal 
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Fig. 1 | Exclusion of S. aureus colonization by dietary Bacillus ina 
human population. a, Areas (in red) from which faecal samples were 
collected in rural populations and analysed for the presence of Bacillus 


carriage was similar in frequency (26/200; 13%), a result that is in 
accordance with previous findings showing a correlation between nasal 
and intestinal colonization”. These rates are considerably lower than 
those commonly found in adult populations during cross-sectional 
culture-based surveys that were performed mainly in hospital-admitted 
individuals in urbanized areas (on average, 20% for intestinal and 40% 
for nasal carriage)!©?!?, 

To examine the hypothesis that bacterial interactions in the gut 
determine intestinal S. aureus colonization, we first analysed the com- 
position of the gut microbiome by 16S ribosomal RNA sequencing. 
However, we did not detect substantial differences in the composi- 
tion of the microbiome between S. aureus carriers and non-carriers 
(Extended Data Fig. 1). 

By contrast, we found a striking correlation between the presence of 
Bacillus bacteria and the absence of S. aureus. Bacillus species (mostly 
B. subtilis; Extended Data Table 1) were found in 101/200 (50.5%) of 
subject samples. S. aureus was never detected in faecal samples when 
Bacillus species were present (P< 0.0001, Fisher’s exact test; Fig. 1b). 
Furthermore, this pathogen-exclusion effect was not limited to the site 
of interaction—the gut—but extended to S. aureus colonization in a 
general fashion. While Bacillus was generally absent from nasal sam- 
ples, S. aureus nasal colonization was never detected when intestinal 
Bacillus was present (P< 0.0001, Fisher’s exact test; Fig. 1c). Notably, 
the levels of S. aureus colonization that we found in non-Bacillus- 
colonized individuals from rural Thailand approximately match 
those reported—using similar culture-based assays—in urbanized 
Western areas. These findings indicate a widespread mechanism 
exerted by Bacillus species that comprehensively inhibits coloniza- 
tion with S. aureus. Moreover, they suggest that S. aureus colonization 
is increased in urban populations because of the lack of a probiotic, 
Bacillus-containing diet. Of particular note, the results also indi- 
cate that the intestinal site has a previously underappreciated role in 
determining general S. aureus colonization, a notion in accordance 
with findings attributing a key role to faecal transmission in MRSA 
recolonization”®, 

When we analysed data from previous 16S rRNA-sequencing-based 
microbiome studies, we found strongly variant results and no 
correlation between the absence of S. aureus and the presence 
of B. subtilis: studies that reported considerable B. subtilis or S. aureus 
numbers (samples with more than 10% colonization by either species) 
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did not reveal exclusion phenomena (average 14.89 + 15.69% coloni- 
zation by both species) (Extended Data Table 2). However, although 
we did not find a correlation, this might be due to the fact that 
such sequencing-based analyses are set up to detect high-order taxo- 
nomic shifts rather than specific differences on the species or genus 
level. 


Quorum sensing and colonization 

Our results, which show no substantial high-order taxonomic differ- 
ences in the microbiome composition between S. aureus carriers and 
non-carriers, exclude an indirect effect of Bacillus on the microbiome 
composition. Rather, we hypothesized that the Bacillus isolates produce 
a substance that directly and specifically inhibits intestinal colonization 
by S. aureus. We first analysed whether there is a growth-inhibitory 
effect of the Bacillus isolates on S. aureus. However, only a minor 
growth inhibition occurred in just 6 out of 105 isolates (we saw a 
maximal 1-mm inhibition zone when using an agar diffusion test with a 
five-times-concentrated culture filtrate). Therefore, a growth-inhibitory 
effect fails to explain the observed complete correlation between 
the presence of Bacillus and the absence of S. aureus, and rules out a 
bacteriocin-mediated phenomenon. 

The factors that are important for S. aureus intestinal colonization 
are poorly understood. One study in mice has implicated teichoic 
acids found in the bacterial cell wall, as well as the cell-surface pro- 
tein clumping factor A (CIfA)*’. Prompted by our previous finding 
that CIfA is positively regulated by the accessory gene regulator (Agr) 
quorum-sensing system*°, we hypothesized that the Bacillus isolates 
secrete a substance that interferes with quorum-sensing signalling. 
Quorum sensing is responsible for sensing the density of the bacterial 
population (the ‘quorum) and controlling a concomitant alteration in 
cell physiology*'. Because quorum-sensing signals and sensors differ 
between different types of bacteria*!, an underlying quorum-quenching 
mechanism could explain the specificity of the inhibitory effect that 
we detected. 

Because the role of quorum sensing in S. aureus intestinal coloni- 
zation is unknown, we first used a mouse model of S. aureus intes- 
tinal colonization to test whether Agr-based quorum sensing is 
involved (Fig. 2a). In all mouse models in our study, we included: first, 
a human faecal isolate belonging to a sequence type (ST) that was 
frequently detected in the faecal isolates that we obtained (ST2196), 
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Fig. 2 | Quorum-sensing dependence of S. aureus intestinal 
colonization. a, Experimental set-up of the mouse intestinal colonization 
model. Mice received, by oral gavage, either 100 j1l containing 

108 CFU ml"! of wild-type (WT) S. aureus strain ST2196 F12 and another 
100 ul of 10° CFU ml’ of the corresponding isogenic agr mutant (n =5 
per group; competitive experiment, shown in b); or 200 yl containing 

10° CFU ml"! wild-type, isogenic agr mutant or Agr (RNAIII)- 
complemented agr mutant (n =5 per group; non-competitive experiment, 
shown in c). CFU in the faeces were determined two, four and six days 
after infection. At the end of the experiment (day seven), CFU in the 
small and large intestines were determined. b, Competitive experiment. 
Total obtained CFU are shown as dot plots; also shown are mean + s.d. 
Bars show the percentage of wild-type among total determined CFU, of 
which 100 were analysed for tetracycline resistance (which is present 

only in the agr mutant). No agr mutants were detected in any experiment; 


according to multi-locus sequence typing (MLST) that we performed 
(Supplementary Table 1); second, a mouse infection isolate (ST88)*?; 
and third, a human infection isolate of the highly virulent MRSA type 
USA300°?. In competition experiments with equal amounts of wild- 
type and isogenic agr mutant strains, only wild-type S. aureus was 
detected in the faeces and colonized the large and small intestines at 
the end of the experiment (competition index > 100) (Fig. 2b and 
Extended Data Fig. 2a, b). Furthermore, in a non-competitive experi- 
mental set-up, only those bacteria expressing the intracellular Agr effec- 
tor RNAIII* achieved colonization; agr-negative control strains never 
did (Fig. 2c and Extended Data Fig. 2c). These data show that, in addi- 
tion to its well-known role in infection*”**, the Agr quorum-sensing 
system is absolutely indispensable for intestinal colonization. 


Fengycin quorum quenchers 
Having established that the Agr quorum-sensing regulatory system is 
essential for S. aureus intestinal colonization, we next analysed whether 
culture filtrates of the Bacillus isolates collected from human faeces 
can inhibit Agr. To that end, we used an S. aureus reporter strain, into 
the genome of which we had transferred the luminescence-conferring 
luxABCDE operon under the control of the Agr P3 promoter*™4, which 
controls production of RNAIII. Remarkably, culture filtrates from all 
105 isolates reduced Agr activity in the S. aureus reporter strain by at 
least 80% (Fig. 3a and Extended Data Table 1). No growth effects were 
observed, substantiating that growth inhibition does not underlie the 
inhibitory phenotype. Furthermore, a culture filtrate from a reference 
B. subtilis strain suppressed the production of key Agr-regulated viru- 
lence factors (phenol-soluble modulins, «.-toxin and Panton- Valentine 
leucocidin; Fig. 3b, cand Supplementary Fig. 1). These results indicate 
that the inhibitory effect of the Bacillus isolates on S. aureus coloniza- 
tion is due to a secreted substance that inhibits Agr signalling. 

To characterize the Agr-inhibitory substance(s), we performed 
experiments with culture filtrate of the reference B. subtilis strain. 
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therefore, all bars show 100% wild type. Given that 100 isolates were 
tested, the competitive index of wild type/agr mutant in all cases is > 100. 
c, Non-competitive experiment with genetically complemented strains. 
Wild-type and isogenic agr mutant strains all harboured the pKX,16 
control plasmid; Agr-complemented strains harboured pKXjRNAII and 
constitutively expressed RNAIII, which is the intracellular effector of Agr. 
During the experiment, mice received 200 1g ml~! kanamycin in their 
drinking water to maintain plasmids. Statistical analysis was performed 
using Poisson regression versus values obtained with the agr mutant 
strains. *P < 0.0001. Data are mean +s.d. Note that no bacteria were found 
in the faeces or intestines of any mouse receiving S. aureus Aagr with 
vector control. The corresponding zero values are plotted on the x axis of 
the logarithmic scale. See Extended Data Fig. 2 for corresponding data 
obtained using strains USA300 LAC and ST88 JSNZ. 


We found that the substance in question was thermostable and 
resistant to protease digestion (Extended Data Fig. 3a). In reversed- 
phase high-performance chromatography (RP-HPLC) (Extended 
Data Fig. 3b), substantial Agr-inhibiting activity was associated with 
two peaks, which we analysed by RP-HPLC/electrospray ionization 
mass spectrometry (ESI-MS) (Extended Data Fig. 3c). This analysis, 
together with the elution behaviour and published literature*®, allowed 
us to identify the Agr-inhibiting substances as members of the fengy- 
cin cyclic lipopeptide family. Because fengycins can differ in specific 
amino acids and in the length of the attached fatty acid, which usu- 
ally is B-hydroxylated (8-OH), and because different Bacillus strains 
produce different fengycin species*’, we used further tandem mass 
spectrometric fragmentation analysis (MS/MS) to identify the specific 
fengycins present in the two active peaks (Extended Data Fig. 3d). 
Fengycins in the first peak were identified as B-OH-C17-fengycin A 
and 3-OH-C16-fengycin B. The second peak consisted of one fengycin 
species, 8-OH-C17-fengycin B. According to RP-HPLC/ESI-MS anal- 
ysis, smaller, adjacent peaks also contained fengycin species, which 
we tentatively identified as 83-OH-C17-fengycin A and the dehydrox- 
ylated versions of the identified three major fengycins (Extended 
Data Fig. 3e). For further analyses, we purified higher amounts of 
8-OH-C17-fengycin B to homogeneity from culture filtrate and veri- 
fied the dose-dependent Agr-inhibiting activity of this pure substance 
(Extended Data Fig. 4). 

Using RP-HPLC/ESI-MS analysis, we found fengycin production in 
all isolates, substantiating the general character of the inhibitory inter- 
action (Extended Data Table 1). Although the production pattern of dif- 
ferent fengycins varied between the analysed isolates, in many of them 
8-OH-C17-fengycin B was the most strongly produced type. Notably, 
almost complete inhibition of Agr was detected at a concentration of 
about 1.4,.M total fengycin (Fig. 3d). This corresponds to the median 
concentration of total fengycin (1.5|1M) produced by stationary- 
phase cultures of the Bacillus isolates (Fig. 3e). 


© 2018 Springer Nature Limited. All rights reserved. 


a d 
S 5 a 15) 
~ 4 o TSB(control)) ©& 
s x TSB (control) 
=a > 104 
ZS x 
2 2 
=o +B. subtilis © 5| +B. subtilis culture 
8 4 culture filtrate * x filtrate (1.4 uM 
> ne final fengycin 
x 2 | concentration) 
0246 0 2 4 6 
Time (h) Time (h) 
e 6 
ca “ 
|= —— PVL 5S 4 iad 
fs “ Mean (7.8 uM) 
BE 3 Sz Median (1.5 1M) 
. a-Toxin “68 ae 
os 2 
28 - 
Control +B. subtilis 
(TSB) culture filtrate 1 
f 
oO 
oO 
© 205 ma PSMcet Z 
= PSMo2 x 
gq 155 = PSMo3 > 
x PSMa4 2 
> 104 m= PSMB1 3 
‘G mm PSMB2 S 
2 5 ® 6-toxin 
> Time (h) 


; -e- B. subtilis WT 
Hoi aly -4- B. subtilis AfenA 
Mrate TOM = B. subtilis AsrfA 


Control 


(TSB) 


+ B. subtilis 
culture filtrate 


Fig. 3 | Inhibition of S. aureus quorum sensing by Bacillus fengycin 
lipopeptides. a, Example of an Agr-inhibition experiment. The Bacillus 
isolate was considered inhibitory if luminescence after 4-h growth of 

S. aureus was less than or equal to half that of the control value (red arrow). 
RLU, relative light units; TSB, tryptic soy broth (control conditions). The 
experiment was performed with n= 2 biologically independent samples. The 
lines connect the means. b, Inhibition of expression of Panton-Valentine 
leucocidin (PVL) and a-toxin, using culture filtrate from the B. subtilis 
reference strain. Western blot analysis of n = 3 biologically independent 
samples was performed with filtrates from S. aureus cultures that had been 
grown for 4h. See Supplementary Fig. 1 for the entire blots. c, Inhibition of 
expression of phenol-soluble modulins (PSMs) using culture filtrate from 
the B. subtilis standard strain. PSM expression was determined by RP-HPLC/ 
ESI-MS after 4h of S. aureus growth. d, Test for Agr-inhibitory capacity of 
Bacillus culture filtrate applied at a final concentration that represents the 
median concentration of total fengycin in the tested 106 Bacillus isolates. 
*P < 0.0001 (two-way analysis of variance (ANOVA) with Tukey’s post-test 
versus control). e, Total fengycin concentrations in stationary-phase culture 
filtrates of the 106 Bacillus isolates (see Extended Data Table 1 for details). 

f, Agr-inhibiting activities of B. subtilis wild-type (WT) in comparison 

to AfenA (fengycin-deficient) and AsrfA (surfactin-deficient) strains. 

*P <0.0001 (two-way ANOVA with Tukey’s post-test versus wild type). 
The experiments shown in ¢, d, f were performed with n = 3 biologically 
independent samples. Data are mean + s.d. 


To provide definitive evidence that fengycin production underlies the 
Agr-inhibiting capacity of Bacillus, we produced an isogenic mutant in 
the reference B. subtilis strain of the fenA gene, which is essential for 
fengycin production*®. RP-HPLC/ESI-MS showed a specific absence 
of fengycins in that mutant strain, whereas surfactins—the predomi- 
nant Bacillus lipopeptides—were still present (Extended Data Fig. 3f). 
Culture filtrate of the fenA mutant strain was devoid of Agr-inhibiting 
activity, in contrast to that of the isogenic wild-type strain (Fig. 3f). 
We also measured an isogenic surfactin-negative mutant strain, which 
showed Agr-inhibiting activity similar to that of the wild-type strain 
(Fig. 3f). These results confirmed that fengycin production is the source 
of the observed Agr inhibition. 
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Fig. 4 | Competitive inhibition of S. aureus AIP activity by fengycins. 

a, Model of competitive Agr inhibition by fengycins. The agrBDCA operon 
(bottom right), whose expression is driven by the P2 promoter, encodes 

the AgrD precursor of the autoinducing-peptide (AIP), which is modified 
and secreted by AgrB. AIP binds to membrane-located AgrC, which, upon 
autophosphorylation, triggers phosphorylation and activation of the DNA- 
binding protein AgrA. In addition to stimulating transcription from the P2 
promoter (autoinduction), AgrA drives expression of RNAIII, which in turn 
regulates the expression of target genes such as those encoding CIfA, a-toxin 
and leukotoxins. RNAIII also encodes the 5-toxin. Furthermore, AgrA 

drives the expression of phenol-soluble modulins (PSMs) in an RNAIII- 
independent fashion. b, Structural similarity of fengycins with AIPs. The 
structures of 8-OH-C17-fengycin B and AIP-I are shown as examples. In 

red are structures and/or amino acids that may differ in different subtypes. 

c, Fengycins work by inhibiting AgrC. Shown is the inhibition of Agr by 
fengycin-containing Bacillus culture filtrate, using an agrBD-deleted S. aureus 
strain in which AgrC was stimulated by exogenously added AIP. *P< 0.0001 
(two-way ANOVA with Tukey’s post-test; values obtained in AagrBD/AIP 
versus AagrBD/control (no AIP), and AagrBD/AIP/culture filtrate versus 
AagrBD/AIP). d, Competitive titration of fengycin-mediated Agr inhibition 
by increasing amounts of AIP, as assayed by the Agr luminescence assay. RLU, 
relative light units. Statistical analysis is by two-way ANOVA with Tukey’s 
post-test versus control. e, Inhibition of Agr in different Agr-subtype S. aureus 
and S. epidermidis (strain 1457) by 8B-OH-C17-fengycin B, as measured by 
relative expression of 6-toxin using RP-HPLC/ESI-MS. Statistical analysis is 
by two-way ANOVA with Tukey’s post-test versus intensity values obtained 
without addition of fengycin. Values were calculated as percentages relative 
to intensity values obtained without addition of fengycin, owing to different 
§-toxin expression levels in the different strains. c-e, Experiments were 
performed with n =3 biologically independent samples. Data are mean +s.d. 
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Fig. 5 | Inhibition of S. aureus colonization by dietary fengycin- 
producing Bacillus spores in a mouse model. a, Experimental set-up. 
n=5 mice per group received 200,11 of 10° CFU ml"! S. aureus strain 
ST2196 F12 by oral gavage. On the next day and every following second 
day, they received 200 1l of 108 CFU ml’ spores of the B. subtilis wild- 
type (WT) or its isogenic fenA mutant, also by oral gavage. CFU in the 
faeces were determined two, four and six days after infection. At the end 
of the experiment (day seven), CFU in the small and large intestines 

were determined. The experiment was performed with (b) or without (c) 
antibiotic pretreatment. b, c, Experimental results. Statistical analysis was 
performed using Poisson regression versus values obtained with the B. 
subtilis WT spore samples. *P < 0.0001. Data are mean + s.d. Note that no 
S. aureus were found in the faeces or intestines of any mouse challenged 
with S. aureus and receiving Bacillus wild-type spores. The corresponding 
zero values are plotted on the x axis of the logarithmic scale. See Extended 
Data Fig. 5 for corresponding data obtained using strains USA300 LAC 
and ST88 JSNZ. 


Mechanism of fengycin- mediated inhibition 

In the S. aureus Agr quorum-sensing regulatory circuit, the secreted 
Agr autoinducing peptide (AIP) interacts with an extracellular 
domain of AgrC, the histidine kinase part of a two-component signal- 
transduction system, to signal the cell-density status? (Fig. 4a). 
Different Agr subgroups of S. aureus, as well as different staphylococ- 
cal species, produce distinct cyclic heptapeptide to nonapeptide AIPs*®. 
AIPs from other subgroups or species frequently inhibit Agr signal 
transduction by competitive inhibition at the AgrC-binding site?’ 
Given that fengycins, being cyclic lipopeptides, show structural 
similarity to AIPs (Fig. 4b), it appears likely that fengycins compete 
with the natural AIP for AgrC binding. The only other theoretically 
possible site of interference from the extracellular space would be the 
membrane-located AIP production/secretion enzyme AgrB. Using an 
S. aureus agrBD deletion strain and stimulation of AgrC by synthetic 
AIP, which led to complete Agr activation, we ruled out that the tar- 
get of Agr inhibition by Bacillus is AgrB (Fig. 4c). In further support 
of a mechanism that works through competition with AIP for bind- 
ing to the AgrC receptor, we found that fengycin inhibition could be 
reversed in a dose-dependent fashion by adding AIP (Fig. 4d). Finally, 
we determined the AIP concentration in early stationary growth 
phase (at 6-8 hours) to be about 11M (Extended Data Fig. 5a), which 
is approximately equal to the concentration of fengycin for which we 
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found complete Agr inhibition (Fig. 3d). These findings indicate that 
fengycins inhibit Agr signal transduction by efficient competitive inhi- 
bition as structural analogues of AIPs. 

The fact that AgrC-AIP interaction differs according to Agr sub- 
type raises the question of whether fengycins have a general ability to 
inhibit Agr. We found that purified 8-OH-C-17 fengycin B inhibited 
Agr in members of all S. aureus Agr subtypes, as well as in S. epider- 
midis (Fig. 4e). Furthermore, the S. aureus strains used in our mouse 
experiments belong to different Agr subtypes (strain USA300, type I; 
strain ST88, type III; strain ST2196, type I). These results indicate that 
fengycins have broad-spectrum Agr-inhibiting activity. 


Bacillus spores eradicate S. aureus 

To validate our findings in vivo and demonstrate the specific role of 
fengycins in the inhibition of S. aureus intestinal colonization, we com- 
pared the impact of the B. subtilis wild-type reference strain and its 
isogenic fenA mutant on S. aureus colonization in a mouse intestinal 
colonization model. We first performed a control experiment to ana- 
lyse the colonization kinetics of B. subtilis when given as spores, which 
corresponds to the form in which Bacillus would be taken up with food 
or probiotic formulae (Extended Data Fig. 5b). We observed transient 
colonization that strongly declined within two days. Importantly, col- 
onization by the B. subtilis fenA mutant was not different to that by the 
wild-type strain, ruling out the possibility that fengycin production as 
such affects B. subtilis colonization. 

Feeding mice B. subtilis spores completely abrogated colonization of 
all tested S. aureus strains in the faeces and intestines, in experimental 
set-ups with or without antibiotic pretreatment to eliminate the pre- 
existing microbiota. (Fig. 5b, cand Extended Data Fig. 5c-f). By contrast, 
spores of the fenA mutant had no notable effect on colonization of any 
S. aureus test strain. As Bacillus intestinal colonization in humans has 
been shown to reach much higher levels than that by S. aureus’—a 
situation likely to be even more pronounced in the tested rural 
population—our mouse data obtained with S. aureus numbers approxi- 
mately equal to or exceeding those of applied Bacillus spores suggest 
that fengycin-mediated interference in quorum sensing contributes to 
the exclusion of S. aureus colonization that we observed in humans. 


Conclusions 

Scientific evidence to support the frequent claims that probiotic nutri- 
ents improve human health is scarce. However, this study provides 
evidence for a molecular mechanism by which probiotic bacteria 
found in food could directly interfere with pathogen colonization. In 
particular, our data underscore the often-debated!®#2 probiotic value 
of B. subtilis. Notably, we found the responsible agents to work by quo- 
rum quenching, demonstrating that pathogen exclusion in the gut may 
work by inhibition of a pathogen signalling system. Furthermore, our 
findings emphasize the importance of quorum sensing for pathogen 
colonization. 

Our study suggests several valuable translational applications 
regarding alternative strategies to combat antibiotic-resistant S. aureus. 
First, the quorum-quenching fengycins—which previously had been 
known only for their antifungal activity*’—could potentially be used as 
quorum-sensing blockers in eagerly sought antivirulence-based efforts 
to treat staphylococcal infections!>*. Second, Bacillus-containing pro- 
biotics could be used for simple and safe S. aureus decolonization strat- 
egies. In that regard, it is particularly noteworthy that our human data 
indicate that probiotic Bacillus can comprehensively eradicate intestinal 
as well as nasal S. aureus colonization. Such a probiotic approach would 
have numerous advantages over the present standard topical strategy 
involving antibiotics, which is aimed exclusively at decolonizing the 
nose”. 


Online content 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and the investigators were not blinded to allocation during 
experiments and outcome assessment, except for when noted. 

Sample collection and bacterial screening. Nasal swabs and faecal samples were 
obtained from 200 Thai healthy volunteers from four different locations in southern, 
central, northeastern and northern Thailand. One sterile nasal swab, a sample 
collection tube, a sterile container and tissue paper were given to each participant. 
All participants provided informed written consent. The study was performed 
in compliance with all relevant ethical regulations and approved by the Siriraj 
Institutional Review Board (approval no. Si 733/2015). All participants were at least 
20 years old (age range 20-87 years; median age 57 + 14.5 years; 131 women and 
69 men) and without history of intestinal disease. None had received any antibiotic 
treatment or stayed at a hospital within at least three months before the study. 

Nasal swabs and faecal samples were streaked on mannitol salt agar (MSA) 
and then incubated at 37°C for 24 h. Positive or negative S. aureus or Bacillus 
colonization could easily be distinguished by either strong growth on the entire 
plate, or the absence of any colonies, respectively. At the time of this analysis, the 
purpose was to obtain and archive colonizing S. aureus strains. As the hypothesis 
regarding Bacillus/S. aureus exclusion was developed only after we obtained the 
results of this analysis, the staff performing the analysis were blinded as to the 
exclusion hypothesis. Isolates were easily recognized as S. aureus or Bacillus by 
colony morphology and colour; however, every isolate was confirmed for spe- 
cies identity using matrix-assisted laser desorption ionization time-of-flight mass 
spectrometry (MALDI-TOF MS; see below), and Bacillus species were further 
distinguished by 16S rRNA sequencing (Extended Data Table 1). To that end, 16S 
rRNA genes were amplified by the polymerase chain reaction (PCR) using primers 
27FB and 1492RAB* and similarity analysis with the basic local alignment search 
tool (BLAST) was used to identify the species. Subjects were considered as perma- 
nently colonized by S. aureus if two positive samples were obtained, tested after a 
four-week interval. All individuals tested either negative or positive for S. aureus 
at both times. In total, 105 Bacillus isolates from 101 individuals were analysed. 
In the samples from four individuals, two isolates each were taken owing to their 
apparent phenotypic differences. 

Bacterial identification using MALDI-TOF MS. Isolates were inoculated onto 
sheep blood agar and incubated for 24 h at 37°C. Bacterial colonies were applied 
onto a 96-spot target plate and allowed to dry at room temperature. Subsequently, 
2 tl of MALDI matrix (a saturated solution of a-cyano-4-hydroxycinnamic acid 
(HCCA) in 50% acetonitrile and 2.5% trifluoroacetic acid) was applied onto the 
colonies and allowed to dry before testing. Then the target plate was loaded into 
the MALDI-TOF MS instrument (MicroFlex LT mass spectrometer, Bruker 
Daltonics). Spectra were analysed using MALDI Biotyper automation control and 
the Bruker Biotyper 2.0 software and library (version 2.0, 3,740 entries; Bruker 
Daltonics). Identification score criteria used were those recommended by the 
manufacturer: a score of >2.000 indicated species-level identification; a score of 
1.700-1.999 indicated identification to the genus level; and a score of <1.700 was 
interpreted as no identification. Isolates that failed to produce a score of <1.700 
with direct colony or extraction methods were retested. S. aureus ATCC25923, 
E. coli ATCC25922 and Pseudomonas aeruginosa ATCC27853 were used as 
controls. 

Bacterial strains and growth conditions. The reference B. subtilis strain and par- 
ent of the fenA and srfA mutants used in this study was strain ZK3814 (genotype 
NCIB3610). The S. aureus strains used in all experiments (except the experiment 
in which we analysed different Agr-subtype S. aureus) were: first, the human fae- 
cal isolate F12 of ST2196 (Supplementary Table 1); second, strain JSNZ of ST88, 
a mouse isolate previously described as mouse adapted*; and third, strain LAC 
of pulsed-field type USA300, an MRSA lineage predominantly involved in com- 
munity-associated infections, but now generally representing the major lineage 
responsible for S. aureus infections in the United States*”. 

Isogenic mutants in agr were previously described (for strain LAC)** or pro- 
duced in this study (for strains JSNZ and F12) by phage transduction of the agr 
deletion from strain RN6911. The agr system is entirely deleted in these strains, 
except for a 3’ part of RNAIII, which is not transcribed owing to the absence of the 
corresponding promoter. All mutants were verified by analytical PCR. 

Owing to the tetracycline resistance introduced in the agr deletion strains, kana- 
mycin derivatives (pKX,) of the pTX, expression plasmid series were constructed 
and used for complementation of Agr. (This was not possible in strain LAC, which 
harbours resistance to multiple antibiotics, including kanamycin.) To that end, we 
treated plasmid pKX15*°—provided by B. Krismer, University of Tiibingen—as 
described** to delete the xyIR repressor gene, in order to make expression of any 
fragment cloned under control of the xy! promoter constitutive. To obtain plasmid 
pKX,RNAIII, the RNAIII BamH1-Mlul fragment was transferred from pTXaR- 
NAIII°°. Plasmid pKX, 16 is the corresponding empty control plasmid, derived 
from pKX16 by analogous deletion of the xyIR repressor gene. 
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To construct the agrBD deletion mutant of strain LAC P3-lux, we used a 
4.8-kilobase PCR product from USA300 genomic DNA that included the agrBDCA 
operon as well as 1 kb upstream and 1 kb downstream; we cloned this product into 
the Smal site of plasmid pIMAY*! and used inverse PCR to delete agrBD. Allelic 
exchange was then performed, and the chromosomal deletion was confirmed by 
PCR using one primer outside of the 1-kb homology arm, followed by sequencing 
of the PCR product. See Supplementary Table 2 for the oligonucleotides used. 

To construct the tetracycline-resistant derivatives of S. aureus ST88 and ST2196, 
we carried out (p11-phage-mediated transduction as described in order to transfer 
the tetracycline cassette in the donor strain (S. aureus RN4220 with integrated 
pLL29) to S. aureus strains ST88 and ST2196°”. 

To construct the B. subtilis fengycin mutant strain, SPP 1-phage-mediated trans- 
duction® was performed to transfer the fenA deletion present in the donor strain 
(BKE18340, a fenA(ppsA)::erm mutant in B. subtilis strain 168 obtained from the 
Bacillus Genetic Stock Center) to B. subtilis strain ZK3814. This was necessary 
as B. subtilis strain 168 bears a mutation in the sfp gene, abolishing lipopeptide 
production. 

Bacteria were generally grown in tryptic soy broth (TSB) with shaking unless 
otherwise indicated. 

Typing of S. aureus isolates. S. aureus isolates were typed by MLST as described™. 
PCR amplicons of seven S. aureus housekeeping genes (arcC, aroE, glpF, gmk, 
pta, tpi and yqiL) were obtained from chromosomal DNA and their sequences 
compared with those available from the PubMLST database (https://pubmlst. 
org/saureus/). Previously undescribed alleles (arcC 520-521 and gmk 337) and 
sequence types (ST4630-ST4638) were deposited to the website. The Agr subtype 
of S. aureus isolates was determined using a modified multiplex quantitative reverse 
transcription PCR (qRT-PCR) protocol’. Two duplex qRT-PCR protocols, using 
the respective described primer sets and two coloured probes each, were set up for 
Agr types I and II, and III and IV, respectively. Isolates for which the Agr type could 
not be determined by that method were analysed for the type of AIP production 
using RP-HPLC/ESI-MS with the chromatography method also used for PSM 
detection (see below), integrating the three major m/z peaks for each AIP type. 
Microbiome analysis. Genomic DNA from each faecal sample was extracted using 
a QIAamp DNA stool Minikit (Qiagen) according to the manufacturer's instruc- 
tions. The DNA was quantified using a Nanodrop spectrophotometer, and 16S 
rRNA paired-end sequencing of the V4 region of 16S rRNA was performed by 
Illumina using an Illumina MiSeq system as described™. 

For all obtained paired-end sequences, the abundance of operational taxo- 
nomic units (OTUs) and alpha and beta diversity were identified using quantita- 
tive insights into microbial ecology (QIIME 1.9.1)°”. This study used the Nephele 
(release 1.6) platform from the National Institute of Allergy and Infectious Diseases 
(NIAID) Office of Cyber Infrastructure and Computational Biology (OCICB) in 
Bethesda, Maryland, USA. The sequences were assigned to OTUs with the QIIME’s 
uclust-based** open-reference OUT-picking protocol®’ and the Greengenes 13_8 
reference sequence set™ at 99% similarity. Alpha diversity was calculated using 
Chaol and Shannon analyses®! and compared across groups using a non-para- 
metric t-test with 999 permutations. 

Growth-inhibition analysis. Growth inhibition of S. aureus by Bacillus culture 
filtrates was tested with an agar diffusion assay. To that end, 10 il of Bacillus culture 
filtrate from each isolate was spotted on sterile filter disks. The filters were left to 
dry and the procedure was repeated four times, after which filters were laid on 
agar plates containing S. aureus, resulting in the analysis of five-times concentrated 
culture filtrate. 

Fengycin purification. To identify the Agr-inhibiting active substance, 10 ml of 
culture filtrate from the B. subtilis reference strain grown for 48 h in TSB were 
applied to a Zorbax SB-C18 9.4 mm x 25 cm reversed-phase column (Agilent) 
using an AKTA Purifier 100 system (GE Healthcare). After washing with three 
column volumes of 100% buffer A (0.1% trifluoroacetic acid (TFA) in water) and 
five column volumes of 30% buffer B (0.1% TFA in acetonitrile), a 20-column 
volume gradient from 30% to 100% buffer B was applied. The column was run at 
a flow rate of 3 ml min7’. Peak fractionation was performed using the absorbance 
at 214 nm, and fractions were subjected to further analysis by RP-HPLC/ESI-MS 
and MS/MS and tested for Agr inhibition (see below). 

To purify larger amounts of the main active peak containing 8-OH-C17-fengycin 
B, we added acetonitrile to 200 ml filtrate from cultures grown under the same 
conditions to a final concentration of 10%; precipitated material was removed 
by centrifugation for 10 min at 3,700gusing a Sorvall Legend RT centrifuge, and 
the obtained cleared supernatant was applied to a self-packed HR 16/10 column 
filled with Resource PHE (GE Healthcare) material (column volume 17 ml). After 
sample application, the column was washed with 10% buffer B for three column 
volumes and 25% buffer B for five column volumes, after which a gradient of 
15 column volumes from 25% to 60% buffer B was applied. We collected 10-ml 
fractions and lyophilized positive fractions (as determined by RP-HPLC/ESI-MS). 
The lyophilisate was redissolved in 2 ml acetonitrile. We added 6 ml of water and 
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removed the precipitated material through a 5-min centrifugation in a table-top 
centrifuge at maximum speed. The cleared supernatant was then further purified 
ona Zorbax SB-C18 9.4 mm x 25 cm reversed-phase column as described above. 
PSM and lipopeptide detection by RP-HPLC/ESI-MS. PSMs were analysed by 
RP-HPLC/ESI-MS using an Agilent 1260 Infinity chromatography system coupled 
to a 6120 Quadrupole LC/MS in principle as described”, but with a shorter column 
and a method that was adjusted accordingly. A 2.1 mm x 5 mm Perkin-Elmer SPP 
C8 (2.7\1m) guard column was used at a flow rate of 0.5 ml min |. After sample 
injection, the column was washed for 0.5 min with 90% buffer A and 10% buffer 
B, then for 3 min with 25% buffer B. Next, an elution gradient was applied from 
25% to 100% buffer B in 2.5 min, after which the column was subjected to 2.5 min 
of 100% buffer B to finalize elution. 

Bacillus culture filtrates or (partially) purified fractions containing lipopep- 

tides (fengycins and surfactins) were analysed using the same column, system 
and elution conditions. To quantify the production of different fengycins, we used 
the two most abundant peaks, corresponding to double- and triple-charged ions, 
for the integration. Agilent mass hunter quantitative analysis version B.07.00 was 
used for quantification. 
Measurement of Agr activity. To determine the Agr-inhibiting activity of Bacillus 
culture filtrates or purified fractions, we measured luminescence emitted by an 
Agr P3 promoter-luxABCDE reporter fusion construct that was inserted into the 
genome of S. aureus strain LAC™*. Strain LAC P3-luxABCDE was diluted 100-fold 
from a preculture grown overnight in TSB before distribution into a 96-well micro- 
titre plate. To 10011 of that dilution, we added 10011 of sterilized culture filtrate 
sample, unless otherwise indicated. Plates were incubated at 37°C with shaking. 
Luminescence was measured with a GloMax Explorer luminometer (Promega) 
every 2h for a total of 6 h. Inhibition was considered significant if the 4-h sample and 
control values differed by at least a factor of two. Of note, the quorum-quenching 
effect exerted by the one-time initial dose of fengycin or fengycin-containing 
culture filtrates was transient and was overcome at later times by the increasing 
intrinsic AIP production. The Agr-inhibiting activity of purified fengycin was also 
measured using quantitative real-time PCR of RNAIII as described*”. 

To determine the Agr-inhibiting activity with target strains other than LAC (Agr 
subtype I), we measured the production of 6-toxin, for which the gene is embedded 
in the Agr intracellular effector RNA, RNAIII, in most staphylococci. Production of 
5-toxin was measured by RP-HPLC/ESI-MS as described above. Strains LAC (Agr 
subtype I), A950085 (Agr subtype II), MW2 (Agr subtype III) and A970377 (Agr 
subtype IV) were used for testing the effect of 83-OH-C17-fengycin B on S. aureus 
of different Agr subgroups. Strain 1457 was used for S. epidermidis. All strains were 
diluted 100-fold from a preculture grown in TSB. }-OH-C17-fengycin B dissolved 
in dimethylsulfoxide (DMSO) was added to each sample to a final concentration 
of 201M and 100,.M. All samples were incubated at 37°C with shaking for 4 h. 
Samples were centrifuged and supernatant was collected for RP-HPLC/ESI-MS 
detection. 

Analysis of PVL and «-toxin expression. S. aureus strain LAC was diluted 100- 
fold from a preculture grown in TSB and inoculated into 500 jl TSB. Then, 250 tl 
of B. subtilis culture filtrate was added into the sample. Samples were incubated at 
37°C with shaking for 4 h. Samples were centrifuged in a table-top centrifuge at 
maximum speed for 5 min; the supernatants were collected and loaded onto 12% 
SDS-polyacrylamide gel electrophoresis (PAGE) gels, which were run at 160 V for 
1h, Proteins were transferred to nitrocellulose membranes using an iBlot western 
blotting system. Membranes were incubated with Odyssey blocking buffer for 1 h at 
room temperature. Anti-c-toxin antibodies (polyclonal rabbit serum; Sigma $7531; 
dilution 1:5,000) or anti-LukF-PV antibodies (affinity-purified rabbit IgG specific 
for a peptide region of LukF-PV, produced by GenScript USA and provided by 
E DeLeo, NIAID; dilution 1:500) were added to the blocking buffer and membranes 
were incubated overnight at 4°C. Then, membranes were washed five times with 
Tris-buffered saline containing 0.1% Tween-20, pH 7.4, and incubated with Cy5- 
labelled goat anti-rabbit IgG (diluted 1:10,000 in blocking buffer) in the dark for 
1 h at room temperature. Membranes were washed five times with the washing 
buffer and scanned with a Typhoon TRIO-+ variable mode imager. 

Preparation of Bacillus spores. B. subtilis wild-type or isogenic fengycin mutant 
strains were inoculated from a preculture (1:100) into 1 litre of 2x SG medium® 
and allowed to sporulate for 96 h. Cells were pelleted, washed with water, and 
resuspended in 20% metrizoic acid (Sigma). Five different concentrations (w/v) 
of metrizoic acid (60% to 20%) were added stepwise to a 50-ml centrifuge tube to 
obtain a density gradient. A cell suspension was added to the top of the gradient, 
and was followed by centrifugation at 40,000g for 60 min at 4°C (as described 
previously™). Spores were found in the middle layers and were collected. They were 
washed three times with 10 ml water. The total obtained number of viable spores 
per ml was determined by serial dilution, plating on TSA, and counting of CFU. 
The total number of heat-resistant spores per ml was determined by submerging 
the spores in a water bath at 80°C for 20 min, followed by serial dilution and 
quantification of CFU per ml as described above. 
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Mouse intestinal colonization model. In vivo studies were approved by the 
Institutional Animal Care and Use Committee of the NIAID. Animal work 
was conducted by certified staff in a facility accredited by the Association for 
Assessment and Accreditation of Laboratory Animal Care (AAALAC). All of the 
animal work adhered to the institution's guidelines for animal use and followed 
the guidelines and basic principles in the US Public Health Service Policy on 
Humane Care and Use of Laboratory Animals, and the Guide for the Care and 
Use of Laboratory Animals. 

All C57BL/6] mice were female and six to eight weeks of age at the time of use. 
In one set-up, before S. aureus was given by oral gavage, mice were pretreated to 
eradicate the pre-existing intestinal microbiota using an antibiotic mix consisting 
of ampicillin (1 g1~), metronidazole (1 g1~'), neomycin trisulfate (1 g1~!) and 
vancomycin (1 g1~') in the drinking water. The last day before gavage, antibiotics 
were omitted from the drinking water. No bacteria could be found in the faeces 
or intestines of mice for seven days after this treatment in a control experiment. 
In another set-up, antibiotic pretreatment was omitted. In all set-ups, S. aureus 
strains were grown to mid-exponential growth phase, washed, and resuspended 
in sterile phosphate-buffered saline (PBS) at 108 CFU ml !. Mice were inoculated 
by oral gavage with 200 1l ofa 108 CFU ml suspension of the indicated S. aureus 
strains, or 1:1 mixtures of wild-type and isogenic agr mutants to reach the same 
final concentration and volume. For the experiments with strains containing plas- 
mids of the pKX, type, mice received kanamycin (0.2 g 1-1) in the drinking water 
during the experiment to maintain plasmids. For the B. subtilis spore competition 
experiment, oral gavage with 200 il of spores of wild-type Bacillus or its isogenic 
AfenA fengycin mutant (10° CFU ml! in sterile PBS) was performed on the day 
following the S. aureus gavage, and repeated every second day thereafter for a 
total of three times (days 2, 4 and 6). Intestinal colonization was evaluated by 
quantitative cultures of mouse stool samples and samples from the small and large 
intestines of mice. In detail, stool was collected and suspended to a final volume 
of 1 ml of PBS, diluted and plated on TSB agar. Plates were incubated for 24 h at 
37°C, and colonies were enumerated. Moreover, after mice were euthanized seven 
days after infection, the small and large intestines were collected, resuspended each 
in 1 ml PBS, and homogenized. Serial dilutions of the homogenates were plated 
on TSB agar and incubated at 37°C. Bacterial colonies were enumerated on the 
following day. In the experiments without antibiotic pretreatment, extracts were 
plated on MSA plates containing 4j1g ml“! oxacillin (for strain USA300 LAC) or 
3g ml! tetracycline (for tetracycline-resistant derivatives of strains ST88 and 
ST2196), incubated for 48 h at 37°C, and enumerated. 
Statistics. Statistical analysis was performed using GraphPad Prism version 6.05 
with one-way or two-way ANOVA, or Fisher's exact test, as appropriate, except for 
the experiments shown in Figs. 2c, 5b, c, and Extended Data Figs. 2b, c, 5c-f, for 
which Stata Release 15 and Poisson regression were used, owing to the exclusive 
presence of 0 values in one group (no variance). For ANOVAs, Tukey post-tests 
were used, which correct for multiple comparisons using statistical hypothesis 
testing. All data show the mean and standard deviation (s.d.). All replicates are 
biological. 
Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability 

Microbiome sequencing data are available from Bioproject with accession number 
483343. All other data generated or analysed during this study are included in the 
published Article or in the Supplementary Information. 
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Extended Data Fig. 1 | Microbiome analysis of S. aureus carriers versus species. c, Chaol index. d, Comparison of relative taxa abundance between 
non-carriers. The microbiota of n = 20 randomly selected S. aureus S. aureus carriers (red) and non-carriers (blue). e, f, Beta diversity, 
carriers (red) and n= 20 non-carriers (blue) were analysed in faecal represented by a principal coordinate analysis plot based on unweighted 
samples. a-c, Rarefaction (species-richness) curves based on 16S rRNA UniFrac (e) and weighted UniFrac (f) metrics for samples from S. aureus 
gene sequences. Data are mean + s.d. a, Shannon index. b, Observed carriers (red) and non-carriers (blue). 
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Extended Data Fig. 2 | Quorum-sensing dependence of S. aureus 
intestinal colonization. Data from strains USA300 LAC and ST88 JSNZ. 
The experimental set-up is the same as in Fig. 2: mice received by oral 
gavage either 100 1l containing 10® CFU ml! of wild-type S. aureus 
strain USA300 LAC or ST88 JSNZ plus another 100 il of 108 CFU ml“! 
of the corresponding isogenic agr mutant (n = 5 per group; competitive 
experiment shown in a, b); or 20011 containing 10° CFU ml! wild- 
type, isogenic agr mutant or Agr (RNAIII)-complemented agr mutant 
(n=5 per group, non-competitive experiment shown in c). CFU in the 
faeces were determined two, four and six days after infection. At the end 
of the experiment (day seven), CFU in the small and large intestines 
were determined. a, b, Competitive experiment. Total obtained CFU are 
shown as dot plots; also shown are mean + s.d. Bars show the percentage 
of wild-type among total determined CFU, of which 100 were analysed 
for tetracycline resistance that is present only in the agr mutant. No agr 
mutants were detected in any experiment; thus, all bars show 100%. Given 
that 100 isolates were tested, the competitive index wild-type/agr mutant 
in all cases is > 100. c, Non-competitive experiment with genetically 
complemented strains. Wild-type and isogenic agr mutant strains all 
harboured the pKX, 16 control plasmid; Agr-complemented strains 
harboured pKX,RNAIII and thus constitutively expressed RNAIII, which 
is the intracellular effector of Agr. During the experiment, mice received 
200 jg ml~' kanamycin in their drinking water to maintain plasmids. 
Statistical analysis was performed using Poisson regression versus values 
obtained with the agr mutant strains. *P < 0.0001. Data are mean + s.d. 
Note that no bacteria were found in the faeces or intestines of any mouse 
receiving S. aureus Aagr with vector control. The corresponding zero 
values are plotted on the x axis of the logarithmic scale. 
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Extended Data Fig. 3 | See next page for caption. 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


Extended Data Fig. 3 | Analysis of Agr-inhibitory substances. 

a, Influence of heat and proteases on Agr inhibition. B. subtilis culture 
filtrate was subjected to heat (95°C for 20 min) or digestion with 
proteinase K (501g ml~!, 37°C, 1h) and the effect on inhibition of Agr 
activity was measured using the luminescence assay with the USA300 
P3-luxABCDE reporter strain (see Fig. 3a). RLU, relative light units. The 
experiment was performed with n= 2 independent biological samples. 
Lines connect the means. (The observed additional suppression of Agr 
activity in the proteinase-K-treated sample at 6 h, compared with the 
B. subtilis culture filtrate sample, is expected owing to proteolytic 
inactivation of intrinsic AIP.) b, Preparative RP chromatography of 

B. subtilis culture filtrate to determine the Agr-inhibiting substance. 
The peaks labelled 2 and 3 showed substantial Agr-inhibiting activities 
in the Agr-activity assay and were identified as fengycins using 
subsequent RP-HPLC/ESI-MS and MS/MS analysis (see c, d). The 
peaks labelled 1 and 4-6 also contained fengycin species (see e). AU, 
arbitrary units. The applied gradient (% buffer B) is shown in green. 


c, Fractions corresponding to Agr-inhibitory peaks 2 and 3 from the 
preparative RP run (b) were subjected to RP-HPLC/ESI-MS. Top, total 

ion chromatograms (TICs) of the RP-HPLC/ESI-MS runs; bottom, ESI 
mass spectrogram of the major peaks. d, MS/MS analysis of the peak 2 

and 3 fractions. Peaks that are characteristic of a given fengycin subtype 
(A or B in this case) are marked in colour. ‘Parent’ refers to the relevant 
numbered peak in the spectrograms above. e, Analysis of further fengycin- 
containing fractions. Peaks 1, 4, 5 and 6 from the preparative RP run (b) 
were also found to contain fengycin species as determined by subsequent 
RP-HPLC/ESI-MS analysis. Shown are the mass spectrograms of the major 
peaks of those runs and the tentative characterization for fengycin type. 
The preparative and analytical chromatography and RP-HPLC/ESI-MS 
analyses (as shown in b, d) were repeated multiple (more than ten) times 
for fengycin purification, with similar results. MS/MS analyses were not 
repeated. f, Analysis of fengycin and surfactin lipopeptide expression by 
the B. subtilis wild-type strain and its isogenic AfenA mutant. 
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Comparisons shown are those versus DMSO control. c, Agr inhibition as 
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Extended Data Fig. 5 | Inhibition of S. aureus colonization by dietary 
fengycin-producing Bacillus spores in a mouse model. a, Concentration 
of AIP-I during S. aureus growth. Strain LAC (USA300) was grown in 
TSB, and AIP-I concentrations were measured by RP-HPLC/ESI-MS. 
Calibration was performed using synthetic AIP-I. The detection limit 

of this assay is around 0.3 1M. The experiment was performed with 

n= 3 independent biological samples. Data are mean + s.d. b, B. subtilis 
colonization kinetics in the mouse intestinal colonization experiment. 
Mice (n=5) received 20011 of a 10® CFU ml“! suspension of wild-type 
B. subtilis or AfenA mutant spores by oral gavage; CFU in the faeces 
were analysed up to five days afterwards. Data are mean + s.d. 

c-f, Inhibition mouse model with strains USA300 LAC and ST88 JSNZ. 
The experimental set-up was as shown in Fig. 5a. In brief, n =4 or 5 mice 
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per group received 200 il of 10° CFU ml"! S. aureus strains USA300 

LAC or ST88 JSNZ by oral gavage. On the next day and every following 
second day, the mice received 200 il of 10° CFU ml’ spores of wild-type 
B. subtilis or its isogenic fenA mutant, also by oral gavage. CFU in the 
faeces were determined two, four and six days after infection. At the end 
of the experiment (day seven), CFU in the small and large intestines were 
determined. The experiment was performed with (c, d) or without (e, f) 
antibiotic pretreatment. Statistical analysis was performed using Poisson 
regression versus values obtained with wild-type B. subtilis spore samples. 
*P < 0.0001. Data are mean +s.d. Note that no S. aureus were found in the 
faeces or intestines of any mouse challenged with any S. aureus strain that 
also received Bacillus wild-type spores. The corresponding zero values are 
plotted on the x axis of the logarithmic scale. 
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Extended Data Table 1 | Fengycin production and Agr-inhibition potency of Bacillus faecal isolates 
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55 subtilis 45 43 113 12 94 0 i?) 216 96 523 
56 sonorensis 167 53 0 0 83 201 109 58 98 671 
57 subtilis 127 104 166 301 61 96 98 246 96 1200 
58 sonorensis 498 510 841 0 967 222 ie) ie} 94 3039 
59 subtilis 0 0 1,008 0 0 297 715 488 82 2508 
61 subtilis 124 84 21 0 160 39 42 335 97 805 
62 licheniformis 93 128 7 48 40 188 0 153 oF 657 
63 amyloliquefaciens 4,153 380 975 0 2,584 1,047 1,975 1,084 95 12197 
64 sonorensis 133 91 187 230 29 387 62 34 95 1154 
65 subtilis 9 126 0 0 35 90 239 36 97 535 
66 sonorensis 254 0 0 0 377 0 131 0 oF 762 
67 subtilis 215 84 200 0 0 0 155 168 97 821 
68 subtilis 41 12 144 ie} ie) 0 115 182 93 494 
69 subtilis 266 27 236 290 132 0 54 195 98 1200 
70 subtilis 54 55 390 0 196 0 187 112 94 994 
71 subtilis 14,881 4,578 88,106 34,879 39,939 16,967 42,502 23,859 9o7 265710 
TA pumilus 157 56 54 14 0 62 177 Al 95 560 
7 licheniformis 281 22 40 i} 0 292 125 204 95 964 
76 subtilis 124 0 74 0 0 101 82 192 97 573 
77 sonorensis 0 0 0 25 0 0 93 13 97 131 
78 amyloliquefaciens 73 99 0 0 79 17 176 13 94 458 
79 amyloliquefaciens 10 ie} 0 0 0 180 51 63 97 304 
80 subtilis 1,741 322 4,222 1,105 3,327 910 2,207 1,529 91 15363 
81 subtilis 1,739 426 5,073 1,579 3,371 998 3,241 1,933 87 18361 
82 subtilis 1,002 i) 3,413 921 1,998 i) 1,710 992 91 10037 
83 licheniformis 356 0 536 4 83 201 107 52 o7 1338 
85 subtilis 52 0 49 0 84 161 i) 132 95 AT9 
87 subtilis 1,327 312 3,931 O 2,763 667 2,624 1,167 98 12790 
88 pumilus 101 0 215 156 313 367 276 216 96 1643 
89 subtilis 105 59 266 0 0 0 51 38 93 519 
91 subtilis 325 91 493 dg 120 290 302 186 98 1825 
92 subtilis 254 0 234 156 275 0 154 66 96 1140 
93 subtilis 493 273 91 342 204 287 277 435 96 2402 
94 subtilis 876 115 134 37 384 190 445 529 91 2712 
95 amyloliquefaciens 351 175 157 146 110 225 196 197 97 1557 
o7 subtilis 1,845 912 4,714 1,310 3,686 1,492 2,484 1,557 86 18000 
98 subtilis 1,367 804 3,572 1,512 2,803 1,511 2,005 1,626 93 15200 
99 subtilis 77 375 705 i} 28 170 115 77 91 1547 
100 amyloliquefaciens 81 117 337 267 105 166 237 502 81 1811 
103 pumilus 249 45 350 207 162 249 536 279 98 2077 
104 subtilis 293 77 105 269 75 509 286 17 99 1632 
106 subtilis 978 796 4415 3,419 1,874 1,378 2,478 1,527 97 16866 
107 subtilis 423 199 322 150 520 517 384 160 98 2675 
108 pumilus 224 114 397 ie} 229 211 413 43 99 1631 
110 subtilis 140 77 317 286 404 127 117 139 95 1607 
111 pumilus 184 104 319 67 96 212 294 120 93 1395 
112 subtilis 470 183 1,412 637 1,211 732 950 655 96 6251 
113 pumilus 463 202 276 204 211 0 156 137 95 1650 
115 subtilis 268 232 297 62 313 410 713 88 o7 2382 
116 subtilis 352 205 369 0 172 298 561 350 96 2306 
117 subtilis 143 104 716 328 0 149 oF 266 98 1803 
118 subtilis 163 34 1,788 0 0 155 306 42 93 2488 
119 amyloliquefaciens 604 256 947 258 0 350 361 104 93 2880 
121 subtilis 503 151 ie) 364 63 84 174 162 98 1502 
122 subtilis 152 311 24 165 86 213 296 83 91 1329 
123 subtilis 8,801 3,540 23,045 10,778 16,465 6,428 16,077 10,706 98 95839 
124 subtilis 106 139 316 168 64 175 195 103 86 1266 
125 subtilis 0 0 0 158 0 318 290 0 95 765 
126 subtilis 288 157 211 110 428 421 185 112 87 1913 
127 subtilis 478 193 103 435 240 303 551 132 96 2434 
128 subtilis 177 156 228 118 96 276 426 48 97 1525 
129 pumilus 249 37 oO 162 0 224 0 144 97 816 
130 subtilis 1,267 0 4,668 1,266 2,940 1,168 2,158 1,863 88 15332 
131 subtilis 4,164 496 513 307 1,226 671 1,948 662 83 9986 
132 subtilis 441 291 491 391 200 562 381 279 95 3036 
134 subtilis 6,647 750 1,011 0 3,143 1,166 2,829 953 97 16498 
136 subtilis 751 233 2,569 773 1,813 890 1,701 1,010 89 9740 
137 subtilis 1,572 328 3,297 1,036 2,221 627 2,266 0 82 11347 
138 pumilus 288 311 232 11 A415 236 236 403 85 2133 
139 subtilis 1,898 709 7,830 2,453 5,328 2,073 3,880 2,210 98 26381 
140 sonorensis 0 106 217 815 225 0 103 3,987 91 5454 
141 thuringiensis 258 26 325 230 0 264 0 0 87 1102 
142 pumilus 422 159 262 ie} 402 335 639 251 87 2471 
143 subtilis 250 37 210 16 64 68 293 89 83 1027 
144 sonorensis 110 208 134 191 0 304 361 91 87 1399 


The table shows intensity values from the integration of m/z peaks associated with the specific fengycin species, as obtained by RP-HPLC/ESI-MS. The two most abundant peaks, corresponding to 
double- and triple-charged ions, were used for the integration. Values are in nM, obtained by calibration using weighed and diluted aliquots of the Bacillus lipopeptide surfactin. 

«Bacillus species were determined by sequencing 16S RNA encoding DNA, as specified in the Methods. 

The percentage of Agr inhibition was determined by dividing the 4-h value obtained in the luminescence assay for the sample (using 1001 of culture filtrate) by that obtained for the control, and 
multiplying by 100. 
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Extended Data Table 2 | Analysis of previous microbiome studies for correlation between the presence of S. aureus and B. subtilis in the 
human intestinal tract 


Study ID Study Name Samples Only B. subtilis Only S. aureus Both Neither 
ERP012803 American Gut Project 6635 1 (0.015%) 304 (4.58%) 0 6330 (95.4%) 
ERP011001 Human gut bacteria that rescue growth 1732 408 (23.61%) 70 (4.05%) 71 (4.11%) 1179 (68.23%) 


and metabolic defects transmitted by 
microbiota from undernourished children 


ERP005437 16S sequencing of Malawian children 1515 118 (7.79%) 6 (0.4%) 4 (0.26%) 1387 (91.55%) 

SRP049113 Human gut microbiota from the ALADDIN 664 2 (0.30%) 61 (9.19%) 7 (1.05%) 594 (89.46%) 
study 

ERP019564 Role of Gut Microbiota in Pathophysiology 481 8 (1.66%) 7 (1.45%) 0 466 (96.88%) 
of Parkinson's Disease 

SRPO73172 DNA from FIT can replace stool for 408 63 (15.44%) 71 (17.40%) 99 (24.26%) 175 (42.89%) 
microbiota-based colorectal 

SRP068240 Human feces metagenome 16s rDNA 350 52 (14.85%) 189 (54%) 89 (25.43%) 20 (5.71%) 
sequencing 

SRP064846 Homo sapiens fecal microbiome transplant 271 20 (7.38%) 47 (17.34%) 6 (2.21%) 198 (73.06%) 

SRP065497 Human gut environment Targeted loci 270 54 (20%) 8 (2.96%) 19 (7.04%) 189 (70%) 
environmental 

ERP021093 Gut microbiome from patients obtained by 268 88 (32.84%) 14 (5.22%) 57 (21.27%) 109 (40.67%) 
16s rRNA sequencing. 

ERP010229 Gut microbial succession follows acute 260 12 (4.62%) 92 (35.38%) 122 (46.92%) 34 (13.08%) 
secretory diarrhea in humans 

ERP010458 Gut microbiota of stroke patients 233 3 (1.29%) 32 (13.73%) 4 (1.72%) 194 (83.26%) 


differentiates from healthy controls 


We included in our analysis all studies found on the EB] Metagenomics website (https://www.ebi.ac.uk/metagenomics/) that had more than 200 participants (independent samples) and which used 
IIlumina Miseq instruments. We pooled raw 16S rRNA sequencing data from the EB] Metagenomics website, and used taxonomic assignment (TSV) files for analysis. The number of sequence reads 
was used to analyse how many samples contained S. aureus or B. subtilis. Samples with a read number of more than O were defined as colonized. When there were no reads, samples were designated 
as noncolonized. 
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Transcription factor dimerization 
activates the p300 acetyltransferase 


Esther Ortega!, Srinivasan Rengachari>®, Ziad Ibrahim!?, Naghmeh Hoghoughi?’, Jonathan Gaucher), 


Alex S. Holehouse’, Saadi Khochbin? & Daniel Panne)? 


The transcriptional co-activator p300 is a histone acetyltransferase (HAT) that is typically recruited to transcriptional 
enhancers and regulates gene expression by acetylating chromatin. Here we show that the activation of p300 directly 
depends on the activation and oligomerization status of transcription factor ligands. Using two model transcription 
factors, IRF3 and STAT1, we demonstrate that transcription factor dimerization enables the trans-autoacetylation of 
p300 in a highly conserved and intrinsically disordered autoinhibitory lysine-rich loop, resulting in p300 activation. 
We describe a crystal structure of p300 in which the autoinhibitory loop invades the active site of a neighbouring HAT 
domain, revealing a snapshot of a trans-autoacetylation reaction intermediate. Substrate access to the active site involves 
the rearrangement of an autoinhibitory RING domain. Our data explain how cellular signalling and the activation and 
dimerization of transcription factors control the activation of p300, and therefore explain why gene transcription is 


associated with chromatin acetylation. 


Signals that emanate from cellular receptors ultimately lead to changes 
in gene expression that drive cellular change and organismal develop- 
ment. Gene expression is typically controlled through the coordinated 
activity of DNA-binding transcription factors, chromatin regulators 
and the general transcription machinery. For instance, in the innate 
immune system, pattern recognition receptors recognize and engage 
with various pathogen-associated molecular patterns’, and sub- 
sequently bind to adaptor proteins such as STING (stimulator of inter- 
feron genes). These adaptor proteins engage the latent DNA-binding 
transcription factor interferon (IFN) regulatory factor 3 (IRF3) and 
enable recruitment and activation of the non-canonical IB kinase 
TBK1!. TBK1 then phosphorylates IRF3 in a C-terminal motif, result- 
ing in the removal of autoinhibition, dimerization and adaptor dis- 
placement”. Activated IRF3 dimers bind to p300/CBP (where CBP 
is CREB-binding protein; p300 and CBP are also known as KAT3B 
and KAT3A, respectively) to stimulate chromatin acetylation and gene 
expression of the antiviral type I IFNs IFNo and IFNG*>°. Type I IFNs 
are secreted and bind to specific cell-surface IFN receptors, which 
results in the activation of Janus kinase-signal transducers and activa- 
tors of transcription (JAK-STAT) signalling®. The activated, tyrosine- 
phosphorylated STATs then dimerize, translocate to the nucleus and 
bind to p300/CBP to stimulate the transcription of IFN-stimulated 
genes’, 

p300/CBP are known to interact with more than 400 binding part- 
ners including the basal transcription machinery’. The large protein 
interactome of p300/CBP results in near-universal recruitment of these 
HATs to enhancers, and p300 occupancy has been used to identify 
enhancers genome-wide”"”. p300/CBP catalyses the acetylation of 
histone H3K27 to form H3K27ac, a modification that is considered 
an ‘activation mark!!. However, recruitment of p300/CBP does not 
always correlate with gene activation and is occasionally associated 
with repression!*~!®. A large number of chromatin regions that bind 
p300/CBP therefore do not contain this canonical H3K27ac modifi- 
cation, which indicates that HAT activity at such sites is blocked!>!”, 


Therefore, it is a major challenge to understand the mechanism that 
enables switching between inactive and active states of p300/CBP on 
enhancers, and to causally link cellular signalling to the recruitment 
of p300/CBP, the regulation of HAT activity and the establishment of 
repressed, poised and active chromatin. 

Here we have investigated how the activation and oligomerization 
status of p300 transcription factor ligands such as IRF3 and STAT1 
affects the catalytic activity of p300. We found that the kinase-activated 
and dimeric, but not the inactive or monomeric, variants of these 
transcription factors support robust p300 HAT activation. We demon- 
strate that transcription factor dimerization enables p300 trans- 
autoacetylation in a lysine-rich, intrinsically disordered autoinhibitory 
loop (AIL) in the HAT domain that serves as a ‘pseudosubstrate’ and 
is important for regulating the HAT activity of p300'®. A crystal struc- 
ture of the core domain of p300 provides a snapshot of a potential 
trans-autoacetylation reaction intermediate in which the AIL projects 
into the active site of a neighbouring p300 molecule. As HAT activation 
is closely linked to transcription factor activation, these results causally 
relate cellular signalling to the activation and DNA targeting of a chro- 
matin modifier and provide mechanistic insights into the long-standing 
and general correlation between an active, acetylated chromatin struc- 
ture and gene transcription. 


Transcription factor dimerization activates p300 

To explore whether p300 is activated by signal-dependent IRF3 dimer- 
ization, we produced three recombinant IRF3 species: inactive mono- 
mers (IRF3); active, TBK1-phosphorylated IRF3 dimers (pIRF3); and 
a truncation mutant that lacked the C-terminal autoinhibitory element 
(IRF3 AC) (Extended Data Fig. 1a, b). Truncation of the C-terminal 
autoinhibitory element allows for p300/CBP binding but abolishes 
IRF3 dimerization’®. We confirmed the oligomerization status by 
gel filtration chromatography (Extended Data Fig. 1b), and investi- 
gated the effect of IRF3 activation and oligomerization status on the 
autoacetylation of p300s in the presence of ['*C]acetyl coenzyme A 
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Fig. 1 | Transcription factor dimerization activates p300. a, p300s was 
incubated for the indicated times in the presence or absence of inactive, 
monomeric IRF3 or TBK1-phosphorylated, dimeric pIRF3. Samples 

were analysed by SDS-PAGE followed by Coomassie staining and 
autoradiography. Representative data of three independent experiments 
are shown. b, Quantification of the autoacetylation of p300s. c, p300 is 
activated by TBK1-mediated IRF3 phosphorylation. p300s was incubated 
with recombinant GST-STING, TBK1 and IRF3 in the presence of ATP 
and ['4C]acetyl-CoA. Top, Coomassie-stained SDS-PAGE gel. Middle, 
analysis of IRF3 phosphorylation on $396 using immunoblotting. Bottom, 
autoradiography. Representative data of three independent experiments 
are shown. d, HAT scintillation proximity assay. The degree of histone 
H4 substrate acetylation was quantified using scintillation counting. 

e, As in a but using inactive, monomeric STATIAN or activated, dimeric 
pSTATIAN. Activated, dimeric pSTAT1LANC that lacks the C-terminal 
TAD did not stimulate the autoacetylation of p300s. Samples were 
analysed as in a. Representative data of three independent experiments are 
shown. f, Quantification of the autoacetylation of p300s. Intensity values 
were normalized by dividing by the maximum autoacetylation signal 
obtained after 60 min. Error bars shown in b, d and f: three independent 
experiments were performed, data are mean + s.d. Data analysis and 
plotting was performed with GraphPad Prism 7.0. For gel source data, see 
Supplementary Fig. 1. 
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(acetyl-CoA). p300s is a short p300 construct that spans from the TAZ1 
to the NCBD/IBiD domains and contains a deletion of the flexible N- 
and C-terminal regions (Extended Data Fig. 4a). p300s autoacetylated 
slowly in the absence of IRF3 (Extended Data Fig. 1c). The inclusion 
of inactive, monomeric IRF3 or IRF3AC did not modify HAT activity 
(Fig. 1a, Extended Data Fig. 1c). By contrast, the inclusion of active, 
TBK1-phosphorylated IRF3 dimers (pIRF3) resulted in a rapid burst of 
autoacetylation followed by a gradual increase of acetylated p300s 
(Fig. 1a, b). As IRF3AC did not support p300 HAT activation, IRF3 
dimerization and not solely p300 binding is essential for HAT activation. 

p300 HAT activation was directly dependent on the TBK1-mediated 
phosphorylation of IRF3 on Ser396, a critical residue for IRF3 acti- 
vation and dimerization?*. Only when both TBK1 and IRF3 were 
included in the reaction did we observe p300 activation (Fig. 1c, 
lane 4). We observed only a modest stimulatory effect of the adaptor 
protein STING (Fig. 1c, lanes 7-12), probably owing to the relatively 
large amounts of TBK1, which is already active and phosphorylates 
IRF3 even in the absence of STING”°. We conclude that IRF3 phos- 
phorylation by TBK1 and its dimerization are required for p300 HAT 
activation. 

To analyse the effect of pIRF3 on p300 activation and acetylation 
of the histone substrate, we established a scintillation proximity HAT 
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assay. We incubated saturating amounts of a biotinylated histone H4 
substrate peptide with p300s in the absence or the presence of equimo- 
lar pIRF3 and increasing concentrations of [*H]acetyl-CoA (Fig. 1d). 
pIRF3 stimulated p300 histone-substrate acetylation, as determined by 
the increased rate of H4 acetylation obtained in the presence of pIRF3 
(Vinax = 43.8 + 5.3 cpm min™! as compared to Vinax = 22.5 +2.8 cpm 
min! in the absence of pIRF3). These data indicate that pIRF3 not 
only stimulates p300 autoacetylation and activation, but also stimulates 
more efficient acetylation of the histone substrate. 

We also investigated the effect of STAT1 on p300 activation. STATs 
are activated in response to cytokine receptor engagement and Janus 
kinase activation”!. JAK-mediated phosphorylation of STAT1 on 
Tyr701 induces dimerization and translocation to the nucleus, where 
STAT1 binds to DNA elements to regulate gene expression. STAT 1 
contains a C-terminal transactivation domain (TAD) through which it 
interacts with p300/ CBP’. A naturally occurring splice variant, STAT 18, 
lacks the TAD and acts in a dominant negative manner~. Structures 
of the active, STAT 1 Tyr701-phosphorylated dimer bound to DNA, as 
well as the STAT1 TAD bound to the TAZ2 domain of CBP, have been 
determined previously***. 

To understand the effect of STAT1 activation and oligomeriza- 
tion status on p300 activity, we produced STAT1AN that lacked the 
N-domain and STAT1ANC lacking the N-domain and the TAD as 
non-phosphorylated monomers or as Tyr701-phosphorylated dimers 
(Extended Data Fig. le-h). We found that p300s autoacetylated slowly 
in the absence of STAT], and that the addition of non-phosphorylated, 
monomeric STAT1AN did not stimulate p300s autoacetylation beyond 
background levels (Fig. le, f). By contrast, the addition of Tyr701- 
phosphorylated STATIAN (pSTAT1AN) dimers to p300s resulted 
in a rapid increase of p300 autoacetylation. Activation required the 
C-terminal TAD of STAT1, as the addition of a Tyr701-phosphorylated 
STAT1 dimer (pSTAT1 ANC) that lacked the TAD did not stimulate 
p300 autoacetylation (Fig. le, f). 

STATI dimerization, and not solely interaction with the TAZ2 
domain, is required for the activation of p300. Unphosphorylated, mon- 
omeric STAT1AN, which contains the TAD and is able to interact with 
the TAZ2 domain of CBP, did not stimulate p300 activity. However, 
stimulation with STAT1 was not as potent compared with that of 
IRF3, possibly because our STAT1 preparation is unphosphorylated 
on Ser727, which is required for maximal gene activation”. Together, 
our data are consistent with a model in which the AIL peptide serves as 
an intramolecular ‘pseudosubstrate’ and a competitive HAT inhibitor’®. 
Dimeric ligands such as pIRF3 and pSTAT1 allow p300 activation by 
bringing two molecules together to enable trans-autoacetylation of the 
AIL, which in turn relieves autoinhibition and enables more efficient 
entry of substrates into the HAT active site. 


Structure of p300 adopts an AIL-swap conformation 
To further understand the role of the AIL in the regulation of these 
structural transitions, we crystallized the hypoacetylated form of 
the catalytic core of p300 comprising the bromo-RING-PHD-HAT 
domains (BRP-HAT) that contained the AIL. Crystals were obtained 
using a similar protocol as published previously’®. Crystals diffracted 
to a minimal Bragg spacing of 3.1 A and we determined the structure 
by molecular replacement (Extended Data Table 1). The crystal form 
contained four p300 molecules in the asymmetric unit (Extended Data 
Fig. 2). Comparison with our previous structure”® showed that the bromo- 
PHD-HAT domains overlay well on each other with a root-mean- 
square deviation (r.m.s.d.) of approximately 1 A. However, the RING 
domains were not visible in the initial electron density map. Anomalous 
difference density maps showed a density peak for the zinc atom of the 
RING domain, but it was not at the expected location. Manual reposi- 
tioning enabled the correct placement of the RING domains into the 
new position and the refinement of the structure (Fig. 2a, Extended 
Data Fig. 3). 

The p300 molecules show an antiparallel arrangement of the BRP-HAT 
domains (Extended Data Fig. 2a). In this configuration, the HAT 
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Fig. 2 | The structure of p300 adopts a AIL-swap conformation. 

a, Monomer I is surface-rendered and monomer II is shown as a cartoon. 
The AIL loop from monomer II is shown in yellow. The AIL lies near the 
HAT substrate-binding groove of monomer I. A disordered segment of the 
AIL is shown as a dotted line. b, Close-up view of the residues of the AIL 
loop from monomer II and residues of monomer I in the substrate-binding 
pocket. c, Binding of the positively charged AIL is mediated by interactions 
with negatively charged residues in the substrate-binding pocket of the 
HAT. 


domains from two neighbouring molecules are closely apposed 
(Fig. 2a). In all protomers, AIL residues 1520-1532 adopt a helical 
extension of «6 which packs against the outwardly rotated RING 
domain of the neighbouring protomer (Fig. 2a). In monomer II, res- 
idues 1566-1581 extend away from the HAT domain and associate 
with the substrate-binding pocket of the HAT domain in monomer 
I, approximately 17 A away from the lysine substrate binding tunnel 
(Fig. 2b). The remainder of the AIL (residues 1532-1564) is disordered. 
In this conformation, positively charged residues K1568, K1569 and 
K1570 project towards the highly electronegative substrate-binding 
pocket of the HAT domain in monomer I (Fig. 2c). Analysis by 
size-exclusion chromatography—multi-angle laser-light scattering 
(SEC-MALS) revealed that p300 is monomeric at low micromolar 
concentrations (see Extended Data Fig. 6), which suggests that the AIL 
loop-swapped interactions do not appear to mediate the formation of 
stable dimers, but may instead constitute more transient self-associations. 
Although the AIL is clearly flexible and the electron density over the 
exchanged region is not visible in all protomers (Fig. 2b, c, Extended 
Data Fig. 2b, c), this arrangement supports the interpretation that, at 
high concentrations and when in close proximity to each other, two 
p300 monomers can engage each other through an AIL loop-swap. 


Structural rearrangement of the RING domain 

We previously proposed that active-site restriction by the RING 
domain is a negative regulatory mechanism for HAT activity*®. A 
restricted active site is predicted to reduce the probability of substrates 
engaging with the active site by random diffusion, and could thus 
be important in enabling the regulation of acetylation by substrate 
recruitment. Consistent with this model, mutations that map to the 
structural framework that holds the RING domain in place result in 
HAT activation in cells*®. In our current structure the RING domain 
rotates by around 39° away from the HAT active site, which results in 
an overall displacement by around 22 A as compared to the previously 
determined structure that lacks the AIL (Fig. 3a). The axis of rotation 
is located perpendicular to the flexible loops L1 and L2 that connect 
the RING to the PHD domain. 
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Fig. 3 | Structural rearrangement of the RING domain. a, The RING 
domain (green) rotates by approximately 39°, resulting in a displacement 
of about 22 A away from the active site. The rotation axis is indicated as a 
grey rod. b, In the loop-swap conformation, residues in the RING-HAT 
interface are disrupted resulting in a more open HAT active site. Leul182 
is positioned approximately 15 A away from the Lys-CoA inhibitor in the 
loop-swap conformation (green) but within 5.5 A in the absence of the 
loop swap (magenta). c, Repositioning of the RING domain enables the 
AIL from monomer II to approach the active site of HAT on monomer I. 
d, Details of the interaction surface of the AIL from monomer II with the 
RING domain of monomer I. 


The inward-rotated conformation (magenta in Fig. 3a) is stabilized 
by interactions between Glu1242 of the RING domain and Arg1645 
and Arg1646 of helix a9 of the HAT domain. In addition, Gln1173, 
Thr1174 and Thr1184 of the RING domain pack against the unusu- 
ally long loop (L1) in the HAT domain that covers the CoA portion 
of the Lys-CoA inhibitor. As a result, Leul182 resides within about 
5.5 A of the lysine moiety of Lys-CoA (Fig. 3b). This inward confor- 
mation of the RING domain thus restricts substrate access to the HAT 
domain: the incoming AIL from the neighbouring p300 monomer 
II would clash with the RING domain in the inward conformation 
(Fig. 3c). 

In the outward-rotated conformation, the interactions that attach 
the RING domain to the HAT domain are mostly disrupted (Fig. 3b). 
Leu1 182 is positioned around 15 A away from the substrate-binding 
site and the RING domain is cradled by the AIL extension of helix a6 
of the neighbouring p300 molecule (monomer II residues 1524-1533; 
Fig. 3d). Despite shape complementarity, with a small buried surface 
area of about 320 A2, the interface is predominantly polar, which is 
uncharacteristic of a typical protein-protein interface. However, this 
interaction could help to stabilize an outward-rotated conformation of 
the RING domain and a more open active site of the HAT, apparently 
to enable access of the AIL and trans-acetylation. 


Regulation of HAT activity by flanking domains 

To systematically analyse the flanking domains, we generated a series 
of p300 constructs (Extended Data Fig. 4a) and analysed the effect on 
HAT activity in vitro and in cells. Overexpression of p300 generally 
resulted in hyperacetylated, active p300 variants (Extended Data 
Fig. 4b, c), which probably masks the functional role of structural 
elements potentially involved in autoinhibition of deacetylated p300. 
Deletion of the RING domain did not considerably alter autoacetyla- 
tion or histone acetylation (Extended Data Fig. 5a). This deletion did 
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Fig. 4 | Regulation of HAT activity by flanking domains. a, Top, 
indicated variants of p300 were transiently co-transfected with p53 in COS 
cells and samples were analysed by western blotting using the indicated 
antibodies. Bottom, quantification of the p300(K1499ac) signal. WT, wild 
type. b, Top, analysis of p53 acetylation. Bottom, quantification of the p53 
acetylation signal. Representative data of three independent experiments 
are shown. For details of the mutants, see Extended Data Fig. 4a. Arg and 
Glu indicate proteins in which lysine amino acids in the AIL segment 
spanning amino acids 1546-1570 were mutated to arginine or glutamate, 
respectively. c, Top, H1299 cells were transfected with the indicated 
constructs and analysed by immunofluorescence using anti-HA for p300 
(green); cell nuclei were stained with Hoechst (blue). Bottom, cells were 
treated with the A-485 HAT or the CBP30 bromodomain inhibitor. The 
percentage of cells showing the indicated phenotype (= 200 cells) is 
indicated below each panel. Scale bars, 10 um. For gel source data, see 
Supplementary Fig. 1. 


not adversely affect the structural integrity of p300, as shown by a crys- 
tal structure of the BARP module containing this deletion (Extended 
Data Fig. 5c). 

Deletion of the AIL (AAIL) in all constructs resulted in decreased 
histone acetylation, but bromodomain deletion (ABd) did not affect 
HAT function (Extended Data Fig. 5a, b). Together, our results are 
consistent with previous observations of CBP that—at least in the 
active, hyperacetylated state of the enzyme—RING deletion does 
not substantially affect HAT activity and that the p300 AIL positively 
contributes to substrate acetylation”’”. We next introduced mutations 
into full-length p300 and monitored their effect on p300 autoacetyl- 
ation and p53 acetylation upon transient co-overexpression in cells. 
Deletion of the RING (AR) and CH3 domains resulted in markedly 
increased p300 autoacetylation and p53 acetylation, but deletion of 
the bromodomain or AIL had no major effect (Extended Data Fig. 5e). 
As expected, introduction of the catalytic mutants D1399Y or Y1467F 
abolished p300 autoacetylation or p53 acetylation (Extended Data 
Fig. 5e). Immunofluorescence analysis showed that wild-type p300 as 
well as a ABd and AAIL deletion were uniformly distributed in the 
nucleus, but that the HAT-activating p300 variants AR and ACH3 
formed nuclear foci that co-localized with p53 (Extended Data Fig. 5d). 
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To validate these results, we analysed and confirmed the phenotype 
of p300 mutants and p53 acetylation in another cell line (Fig. 4a, b). 
In addition, we analysed p300 variants in which eleven lysine amino 
acids (spanning amino acids 1546-1570 of the AIL) were mutated to 
arginine or glutamate, and found reduced or slightly increased p300 
autoacetylation or p53 acetylation levels, respectively (Fig. 4a, b). 

As we observed the formation of nuclear foci only with HAT- 
activating variants, we proposed that hyperacetylation drives p300 to 
form biomolecular condensates in cells. Accordingly, the introduction 
of a HAT-inactivating D1399Y mutation into p300 ARING, treatment 
with the p300 HAT inhibitor A-4857%, or treatment with the p300/ 
CBP bromodomain inhibitor CBP30 greatly reduced foci formation 
(Fig. 4c). We therefore conclude that HAT activation drives biomo- 
lecular condensation of p300 in cells, apparently through substrate 
engagement by the bromodomain. 


Regulation of p300 by the AIL and RING domain 

We next sought to understand how the highly conserved and intrinsi- 
cally disordered AIL segment contributes to the regulation of the cata- 
lytic function of p300. The AIL spans amino acid residues 1532-1567 
and is positively charged in the deacetylated state, with an estimated 
isoelectric point (pI) of 10.9, and net charge of +-7 at neutral pH. By 
contrast, upon autoacetylation of residues spanning Lys1542-15607°, 
we estimate a pl of 3.5 and a net charge of —2. As the proximal sub- 
strate-binding groove of p300 is largely acidic (Fig. 2c), we proposed— 
consistent with earlier predictions*°—that a deacetylated AIL would 
engage the substrate binding site through electrostatic interactions, 
presumably to prevent access of positively charged lysine-containing 
substrates. Given the disordered nature of the AIL, this proposed inter- 
action is expected to be highly dynamic”’. 

We tested this hypothesis through all-atom Monte Carlo simula- 
tions*. To make this approach tractable, our simulations held the back- 
bone dihedral angles associated with the folded domains fixed, but all 
other degrees of freedom, including all backbone and side chain dihe- 
dral angles in the AIL, were fully sampled. As a result, these simulations 
should be seen to assess how the AIL interacts with the remainder of 
p300 given the observed crystal structure. Simulations were performed 
on the AIL in the deacetylated and acetylated states in the context of 
the p300 monomer. These simulations enabled us to investigate how 
acetylation influenced the conformation and intramolecular interac- 
tions of the AIL. 

Simulations of the deacetylated AIL revealed the presence of exten- 
sive yet highly degenerate electrostatic interactions between the AIL 
and the RING domain and between the AIL and the HAT substrate- 
binding site. These interactions were quantifiable in terms of the 
normalized distances between pairs of amino acid residues (Fig. 5a, 
Supplementary Video 1). Lysine residues in the AIL dynamically asso- 
ciate through long-range electrostatic interactions with acidic residues 
(E1334, E1442, E1505, D1622, D1625 and D1628) in the p300 HAT 
substrate-binding pocket (Fig. 5c). The importance of these residues 
for substrate acetylation has been shown previously”, and nuclear 
magnetic resonance data for CBP confirm that the AIL is intrinsically 
disordered in the deacetylated state””. 

By contrast, in the acetylated state we found no interactions between 
the AIL and the substrate binding site (Fig. 5b and Supplementary 
Video 2). The acetylated AIL essentially behaved like a self-avoiding 
random coil without any strong biases for interaction with itself or with 
the surrounding folded domains, including the bromodomain. It has 
been proposed that the AIL of CBP, when acetylated on K1596 (K1558 
in p300), engages the bromodomain intramolecularly, thus competing 
with histone binding and negatively regulating substrate acetylation”. 
Isothermal calorimetry experiments showed the highest binding 
affinity for multiacetylated peptides, including the diacetylated histone 
peptides H3(K14ac/K18ac) and H4(K12ac/K16ac), generally following 
the pattern KacNNNKac (Extended Data Table 2). Monoacetylated 
peptides typically had weaker binding affinity. A crystal structure of 
the H4(K12ac/K16ac) peptide bound to BARP (Extended Data Fig. 5c) 
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avoiding random coil. Electrostatic interaction mediated by conserved 
lysine residues between K1542 and K1560 of the AIL and aspartic acid or 
glutamic acid residues around the active site of the HAT domain (E1334, 
E1351, E1442, D1444, E1505, D1622, D1625 and D1628). The extensive 
contacts between the AIL and the RING domain originate in part from the 
proximity of the RING domain to the AIL in its inhibitory conformation. 
b, Normalized distances between the AIL and all residues in the active 
(acetylated) monomer. After acetylation, lysine-mediated electrostatic 
interactions are lost. c, Representative conformations with the AIL shown 
as an ensemble for the inactive deacetylated monomer (left) and the active 
acetylated monomer (right). The C, atoms of residues in the AIL are 
coloured according to charge: blue, positive; red, negative; green, non- 
charged. The HAT substrate-binding groove is more exposed in the active 
acetylated state, due to both the relative position of the RING domain 

and the lack of preferential interactions by the AIL. d, Intermolecular 
interactions in the loop-swapped dimer between the AIL of one HAT and 
the adjacent subunit of the other. The adjacent subunit is either in the 
active (top) or inactive (bottom) conformation. In the active state, the AIL 
is able to directly engage with residues E1442 and E1444 from the adjacent 
HAT substrate-binding groove, which suggests that certain orientations of 
the RING domain can sterically hinder access to the AIL. 


confirmed the acetyllysine-specific binding mode. However, a AIL 
peptide acetylated on the three lysines K1549, K1558 and K1560— 
corresponding to some of the most highly acetylated residues in the 
AIL”°—failed to bind to the BRP module. Thus our interpretation is 
that the multiacetylated AIL is not a substrate for the bromodomain, 
presumably because of suboptimal spacing or sequence environment 
of the acetylated lysine sites of the AIL. 

To understand how the RING domain influences the ability of sub- 
strates—including the AIL—to enter the active site of an adjacent p300 
molecule, we performed simulations of the AIL in the context of the 
loop-swapped dimer, using a harmonic potential to maintain the AIL 
in the active site in order to assess potential intermolecular interactions 
(Fig. 5d, Extended Data Fig. 6a). In the active RING conformation, the 
AIL is able to engage the substrate binding site. However, in the inactive 
conformation, the frequency of contacts between the AIL and the acidic 
active site residues E1442 and D1444—residues proximal to the lysine 
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substrate binding tunnel—was reduced by 70-75% (Fig. 5d). Thus, in 
the inactive conformation, the RING domain at least partially reduces 
catalytic activity by limiting accessibility of the active site to the AIL 
and other substrates. 

One prediction from our models is that the deacetylated form of p300 
adopts a more compact conformation, owing to dynamic engagement 
of the AIL with the HAT substrate-binding site, whereas the acetylated 
form adopts a more ‘oper’ conformation (Fig. 5d). To test this possibility, 
we produced hypo- and hyperacetylated p300 variants (Extended 
Data Fig. 6e-g) and analysed the preparations by SEC-MALLS. All 
preparations were monomeric at the concentration tested (2 mg ml!) 
(Extended Data Fig. 6b-d, Extended Data Table 3). Hyperacetylation 
of p300 BRP-HAT resulted in a small decrease in the elution volume, 
which is indicative of a larger hydrodynamic radius (Extended Data 
Fig. 6b). A similar result was obtained upon comparison of hyper- and 
hypoacetylated BRP-HAT-CH3 (Extended Data Fig. 6c). By contrast, 
a variant that lacks the AIL showed no change in the elution volume 
upon hyperacetylation (Extended Data Fig. 6c). Our data are therefore 
consistent with the model that the catalytic p300 ‘core’ adopts a com- 
pact conformation in the hypoacetylated state, with autoacetylation 
resulting in a more extended conformation. 


Discussion 

Our findings provide detailed mechanistic insights into how cellular 
signalling controls the activity of a chromatin regulator. We propose 
a multi-step process for p300 HAT activation and signal transmis- 
sion to chromatin (Extended Data Fig. 7a—d). In the basal state, the 
deacetylated AIL is expected to maintain an overall positively charged 
environment in close proximity to the active site of the enzyme, thus 
preventing access of positively charged lysine-rich substrates. Direct 
access to the CoA-binding tunnel and autoacetylation of the AIL in cis 
appears to be prohibited, in part due to the positioning of the RING 
domain (Fig. 5d). 

Cellular signalling initiates phosphorylation of transcription fac- 
tors, such as IRF3 or STAT 1, which results in their activation and 
dimerization. The activated, dimeric transcription factors are in their 
DNA-binding-competent conformation and can engage two molecules 
of p300 in the nucleus, thus increasing the likelihood of AIL disen- 
gagement from its inhibitory position in cis and of its capture in trans 
by a second p300 molecule. Association of two p300 molecules does 
not necessarily require precise stereospecific interactions between the 
structured domains, because acetylation at several lysines in the AIL 
indicates a series of possible conformations in such transiently asso- 
ciating dimers. We predict that regulated oligomerization uncouples 
recruitment from HAT activation, which could explain why not all 
p300/CBP recruitment events result in chromatin acetylation and gene 
activation !?-173, 

It has been proposed that enhancer RNA interacts with the AIL to 
regulate CBP HAT activity**. We have attempted to reproduce these 
results using KIf6, one of the most potent enhancer RNAs reported**. 
We could not detect p300 HAT activation using up to equimolar 
amounts of KIf6 (Extended Data Fig. 7e, g). We note that, in a pre- 
vious study, CBP was purified in buffer containing EDTA; this is 
detrimental to the structure of p300/CBP owing to the presence of 
multiple zinc-binding domains**. When unfolded by incubation with 
EDTA, CBP and p300 have a high tendency to aggregate and to form 
non-specific interactions*®. Paradoxically, as the HAT domain is not 
affected, inclusion of EDTA can have an ‘activating’ effect in biochem- 
ical assays, apparently due to such non-specific aggregation (Extended 
Data Fig. 7f). The detrimental effects of EDTA on the structure and 
function of p300/CBP need to be taken into account in the interpre- 
tation of such data. 

The ability of certain histone-modifying enzymes to bind to the 
post-translational modification (PTM) they generate has led to models 
in which such enzymes might propagate modified chromatin 
domains by a positive-feedback loop**. According to this view his- 
tone PTMs and other chromatin modifications form an additional, 
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DNA-sequence-independent layer of the genome, which is read out 
by enzymes that recognize these modifications to ‘epigenetically’ reg- 
ulate genomic function*®. An alternative view proposes that histone 
PTMs ultimately depend on DNA-sequence-dependent recruitment of 
chromatin modifiers, and so do not necessarily form an independ- 
ent ‘epigenetic’ layer of the genome®*”-*’. The controversy has arisen 
because it has been difficult to disentangle, for most chromatin regu- 
lators, the relative contributions of DNA targeting and histone PTM 
substrate engagement to the overall chromatin-modification reaction. 

We show that regulation of p300 is linked to the activation and 
oligomerization status of transcription factor ligands, and therefore 
conclude that specificity for p300-mediated chromatin acetylation 
arises mainly through transcription-factor-mediated and DNA- 
sequence-dependent genome targeting. The next question is how the 
bromodomain contributes to p300 function. Although it is clear that 
the bromodomain can engage acetylated histone peptides and bind to 
hyperacetylated chromatin”, deletion or mutation of the bromo- 
domain has no apparent effect on substrate acetylation”, has only 
minimal effects in a haematopoiesis model system”, and bromodo- 
main inhibition does not adversely affect genome targeting of CBP*?. 

We favour a model in which DNA binding provides the lead anchor- 
ing mechanism: local hyperacetylation increases the binding valency 
by enabling bromodomain substrate engagement, which further helps 
to compartmentalize the biochemical reaction and contributes to sig- 
nal maintenance“®. p300 HAT-activating mutants form biomolecular 
condensates in cells when transiently overexpressed (Fig. 4c, Extended 
Data Fig. 5d). Treatment with a HAT inhibitor or bromodomain inhib- 
itor greatly reduces the formation of condensates, which indicates that 
hyperacetylation and bromodomain-substrate engagement are critical 
in driving assembly. The formation of condensates, possibly through 
phase-separation, may provide a mechanism to enable signal integra- 
tion on enhancers and transcriptional control. It will be critical to 
disentangle cause-effect relationships of DNA targeting, chromatin 
modification and histone PTM substrate engagement of other chro- 
matin regulators*-*”, 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Constructs. For cell-free protein expression, CDNA of p300 (NCBI reference 
sequence: NM_001429.3) variants were cloned into the pIlVEX2.4d vector (Roche) 
with a N-terminal 6 x His tag and a C-terminal Flag tag. In the AR constructs, the 
RING domain encompassing residues 1169-1241 was replaced by a glycine amino 
acid residue linker. In the AAIL constructs, loop amino acid residues comprising 
residues 1520-1581 were replaced by the flexible linker sequence SGGSG. For 
Escherichia coli expression, cDNA encoding residues 1048-1282, for the BRP or 
BARP were cloned into the vector pETM-33 (EMBL) with a tobacco etch virus 
(TEV)-cleavable N-terminal glutathione S-transferase (GST) tag. p300 BRP_HAT 
variants were cloned into pFASTBACI1 (Thermo Fisher) and expressed in insect 
cells as shown previously”®. p300s constructs, spanning amino acid residues 
324-2094, were cloned into pFASTBACI1 vector with an N-terminal Flag tag. 
Haemagglutinin (HA)-tagged full-length p300 variants were cloned into pcDNA3.1 
(Thermo Fisher). Point mutations were introduced by QuikChange mutagenesis 
(Agilent). Point mutations and nucleotide deletions carried out in p300 full-length 
(1-2414) or p300s (324-2094) were done through transfer vectors as described 
previously”®. STAT1AN (136-748), STATIANC (136-713) and IRF3AC (1-382) 
with a C-terminal intein tag were cloned into the pTXB1 vector (New England 
Biolabs) using the restriction enzymes Ndel (STAT1) or NcoI (IRF3) and Spel. 
IRF3 (1-427) with an N-terminal His-tag cleavable by TEV protease was cloned 
using the restriction enzymes Ncol and Xhol into the vector pETM-11 (EMBL). 
All constructs were confirmed by DNA sequencing. 

Expression and purification. Expression and purification of Flag-tagged p300s 
constructs was done as described previously”. This method enables purification 
of p300s variants that are already pre-acetylated. Expression and purification of 
p300 BRP_HAT and SIRT2 were performed as described previously”*. TBK1 was 
expressed in insect cells and purified as described previously”. Cell-free protein 
synthesis was done in a 501] reaction volume. In brief, 101g ml! of His-p300 
variants in pIVEX2.4d were added to a reaction mixture containing 1 mM amino 
acid mix, 0.8 mM rNTPs (guanosine-, uracil-, and cytidine-5/-triphosphate ribo- 
nucleotides), 1.2 mM adenosine-5’-triphosphate, 55 mM HEPES, pH 7.5, 68 uM 
folinic acid, 0.64 mM cyclic adenosine monophosphate, 3.4 mM dithiothreitol 
(DTT), 27.5 mM ammonium acetate, 2 mM spermidine, 541M ZnCl,, 80 mM 
creatine phosphate, 208 mM potassium glutamate, 16 mM magnesium acetate, 
250 1g ml! creatine kinase, 27 pg ml"! T7 RNA polymerase, 0.175 1g ml! tRNA 
and 67 11 ml“! $30 E. coli bacterial extract. Incubation was carried out at 22°C 
with agitation for 16 h. Proteins were purified using Ni-NTA chromatography 
(IMAC Sepharose 6 FF, GE healthcare) in buffer 1 (20 mM TRIS, pH 8.0, 300 mM 
NaCl, 1 mM DTT, 5M ZnCl.) containing Complete Protease Inhibitors EDTA- 
Free (Roche). The resin was washed with 20 column volumes of buffer 1 and the 
protein eluted with 5 column volumes of buffer 1 containing 300 mM Imidazole. 
The protein was concentrated in a pre-washed Amicon Ultra 0.5 ml Ultracel 10K 
centrifugal filter (molecular weight cut off 10 kDa; EMD Millipore). The protein 
was buffer-exchanged into buffer 1 using 0.5 ml Zeba Spin desalting columns 
(molecular weight cut off 7 kDa; Thermo Scientific), flash-frozen in liquid nitrogen 
and stored at —80°C. 

For expression of GST-BRP and GST-BARP fusion proteins in E. coli BL21 
(DE3), LB medium enriched with 100,1M ZnCl, was used. Cell pellets were resus- 
pended in buffer 1 containing Complete Protease Inhibitors EDTA-Free (Roche) 
and lysed using a microfluidizer (Microfluidics Corp.). The lysate was clarified 
by centrifugation for 30 min at 39,000g in a JA-25.5 rotor (Beckman) and applied 
to a Glutathione Sepharose 4 Fast Flow resin according to instructions from the 
manufacturer (GE Healthcare). The resin was washed with buffer 1 and incubated 
with His-tagged TEV protease (1:100 w/w) for 14-16 h at 4°C. Subtractive Ni-NTA 
chromatography (IMAC Sepharose 6 FF, GE Healthcare) was then used to remove 
the residual His-tag and TEV protease. The untagged protein was further purified 
by gel filtration on a High Load 16/60 Superdex 75 column (GE Healthcare) equil- 
ibrated in 20 mM HEPES, pH 7.5, 300 mM NaCl, 0.5 mM TCEP and 5M ZnCh. 
The final protein was concentrated to 15 mg ml! in a prewashed Amicon Ultra-15 
Centrifugal filter (molecular weight cut off 10 kDa; EMD Millipore), flash-frozen 
in liquid nitrogen and stored at —80°C. 

The expression and purification of non-phosphorylated STAT1 variants 
(STATIAN, STATIANC) and IRF3AC (1-382) was done in E. coli using the 
IMPACT expression system (New England Biolabs). For the expression of Y701 
phosphorylated variants (pSTATIAN, pSTAT1ANC), proteins were co-ex- 
pressed with Elk receptor tyrosine kinase domain in E. coli BL21(DE3) TKB1 
cells (Agilent). Cells were collected by centrifugation and resuspended in buffer 2 
(20 mM HEPES pH 7.5, 500 mM NaCl). The cells were lysed in a microfluidizer 
(Microfluidics Corp.) and the soluble fraction was obtained by centrifugation for 
30 min at 39,000g in a JA-25.5 rotor (Beckman). The supernatant was first passed 
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over chitin beads (New England Biolabs) and washed with buffer 2 for 10 column 
volumes. The protein was cleaved at 4°C for 16 h in buffer 2 containing 50 mM 
DTT, eluted and further purified by gel filtration on a High Load 16/60 Superdex 
200 column (GE Healthcare) equilibrated in buffer 2. 

GST-STING, comprising the soluble cytoplasmic domain spanning amino 
acids 138-378, was expressed in E. coli BL21(DE3) at 37°C for 3 h. The cells were 
collected by centrifugation and resuspended in buffer 3 (20 mM TRIS, pH 8.0, 
300 mM NaCl, 1 mM DTT) containing Complete Protease Inhibitors EDTA-Free 
(Roche). The cells were lysed in a microfluidiser (Microfluidics Corp.) and the 
soluble fraction was obtained by centrifugation as above. The supernatant was 
passed over equilibrated Glutathione Sepharose 4 Fast Flow resin according to 
instructions by the manufacturer (GE Healthcare). The resin was washed with 
buffer 3 and eluted with 10 mM reduced Glutathione in buffer 3. The protein was 
further purified by gel filtration on a High Load 16/60 Superdex 200 column (GE 
Healthcare) equilibrated in 20 mM HEPES, pH 7.5, 300 mM NaCl, 0.5 mM TCEP. 
The final protein was concentrated to 16 mg ml“! in a prewashed Amicon Ultra-15 
centrifugal filter (molecular weight cut off 30 kDa; EMD Millipore), flash-frozen 
in liquid nitrogen and stored at —80°C. 

IRF3 was expressed in E. coli BL21(DE3) at 18°C for 16 h. The cells were col- 

lected by centrifugation and resuspended in buffer 2 containing 10 mM imida- 
zole. The cells were lysed in a microfluidiser (Microfluidics Corp.) and the soluble 
fraction was obtained by centrifugation as above. The supernatant was passed over 
Ni?*+-conjugated IMAC Sepharose resin (GE Healthcare) and washed with buffer 
2 containing 20 mM imidazole. The protein was eluted in buffer 2 containing 
500 mM imidazole and was further purified by gel filtration on a High Load 16/60 
Superdex 200 column in buffer 2 containing 0.5 mM TCEP. IRF3 was phospho- 
rylated in vitro at a 1:10 molar ratio TBK1:IRF3 (1 mg ml) in presence of 5 mM 
MgCl, and 1 mM ATP. The reaction was incubated at 30°C for 1 h and then for 
an additional 10 h at 21°C. Phosphorylated IRF3 was further purified by size- 
exclusion chromatography on a Superdex $200 16/60 column (GE Healthcare) in 
20 mM HEPES, pH 7.5, 300 mM NaCl, 0.5 mM TCEP. The production of recom- 
binant histones was done following standard procedures”. 
Crystallization and structure determination. The p300 BRP_HAT construct 
comprising the AIL and the mutation Y1467F was deacetylated as described pre- 
viously”®. The protein at 4.5 mg ml! was incubated with a threefold molar excess 
of the bi-substrate inhibitor Lys-CoA* before crystallization. Crystals in the P2 
space group were grown by hanging-drop vapour diffusion at 4°C by mixing equal 
volumes of protein and crystallization solution containing 100 mM HEPES, pH 7.5, 
18-22% polyethylene glycol 3350 and 0.2 M NaCl. Crystals were cryoprotected in 
20-25% ethylene glycol and drop-frozen in liquid nitrogen. We collected native 
diffraction data to a minimum Bragg spacing of 3.1 A resolution at the ESRF on 
beamline ID29 under a nitrogen gas stream at 100 K, at a wavelength of 1.282 A. 
We processed the data with XDS (Extended Data Table 1). The structure of the 
p300 BRP_HAT was determined by molecular replacement using Phaser. There 
are four copies in the asymmetric unit and the RING domains were initially not 
visible in the electron density map and are partially disordered. Inspection of an 
anomalous difference map indicated peak density for the zinc ions and enabled 
positioning of the RING domain in the outward-rotated conformation. A final 
model was produced by iterative rounds of manual model building in Coot and 
refinement using PHENIX. The final model contains residues 1045 -1664 with a 
deletion of residues 1534-1567 and was refined to a 3.1 A resolution with an Ryork 
and an Rfree of 19% and 26%, respectively (Extended Data Table 1). Analysis of the 
refined structure by MolProbity showed that there are no residues in disallowed 
regions of the Ramachandran plot. The MolProbity all-atom-clash score was 1.91, 
placing the structure in the 100th percentile among structures refined at 3.1 A 
resolution (n = 2,108). 

The BARP construct at 15 mg ml~! was mixed with 2 mM ofa 11-mer his- 
tone peptide H4 (10-20) GLGKacGGA KacRHR (only the underlined amino acid 
sequence is visible in the electron density map) containing two acetylated lysine 
residues at K12 and K16, H4(K12ac/K16ac). Crystals in the P2)2)2) space group 
were grown by hanging-drop vapour diffusion at 21°C by mixing equal volumes 
of protein and crystallization solution containing 1.6 M ammonium sulfate and 
100 mM bicine at pH 9.0. Crystals were cryoprotected in 20% ethylene glycol and 
drop-frozen in liquid nitrogen. We collected native diffraction data to a minimum 
Bragg spacing of 2.5 A resolution at the ESRF on beamline ID29 under a nitro- 
gen gas stream at 100 K, at a wavelength of 1.0 A (Extended Data Table 1). Data 
processing, molecular replacement and refinement were performed as indicated 
above. The final model contains two copies of the BARP module corresponding 
to residues 1049-1279 of p300 in the asymmetric unit. As expected, replacement 
of the RING domain residues 1169-1241 by a single glycine amino acid linker 
did not adversely affect the remainder of the BP module. Analysis of the refined 
structure by MolProbity showed that there are no residues in disallowed regions 
of the Ramachandran plot. The MolProbity all-atom-clash score was 0.97, placing 
the structure in the 100th percentile (n = 6,960). 
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Monte Carlo simulations. All-atom Monte Carlo simulations were performed 
using ABSINTH implicit solvent model and version 2 of the CAMPARI Monte 
Carlo simulation engine (http://campari.sourceforge.net)*". The initial AIL loop 
was constructed using MODELLER, and the complete set of backbone and side 
chain torsional angles were sampled for the AIL for which electron density was 
missing. Simulation analysis was performed with MDTraj and CTraj (http:// 
pappulab.wustl.edu/CTraj.html)””. The backbone degrees of freedom of the folded 
domains were not sampled, while all amino acid side chains were fully sampled. 
CAMPARI simulations explore conformational space through perturbation to 
the torsional angles (as opposed to Cartesian positions, as is typical for molec- 
ular dynamics). Consequently, a fully closed loop represents a major sampling 
challenge. To address this, we severed the covalent backbone bond between the 
N-terminal part of the AIL loop and the folded domain, and replaced this bond 
with a strong harmonic potential that recapitulates the distances constraint asso- 
ciated with the covalent bond. This allows moves to fully rotate the chain and 
markedly improves the efficiency of conformational sampling. 

We generated 5,000 independent non-overlapping starting configurations and 
used a clustering approach to identify the most distinct 200 conformations. These 
were used as the starting conformations for full simulations. We ran 200 independ- 
ent simulations of the deacetylated and acetylated p300 in the monomeric form, 
and 200 independent simulations of the loop-swapped p300 dimer in the active 
and inactive form (800 simulations total). Analysis was performed after an initial 
equilibration. Dimer simulations applied a harmonic potential between residue 
1550 from the AIL and residue 1442 from the other monomer to maintain the AIL 
in the active site. This enabled us to directly compare active-site accessibility of the 
AIL. For monomer simulations, no restraints were applied. 

Each residue on the folded structure was evaluated for contacts with any residue 
in the AIL, and these contacts were summed to give an effective contact score. In 
this manner, the residues on the folded structure that most frequently interacted 
with any residue on the AIL were directly identified. Interaction was primarily elec- 
trostatic in nature, with residues E1334, E1444, E1505, D1622, D1625 and D1628 
engaging in direct interactions. There are also extensive interactions between the 
AIL and the RING domain, although we cannot rule out that these interactions 
are driven by the harmonic potential applied to pull E1442 towards the active site. 
As might be expected, the AIL-RING interactions differed between the active and 
inactive conformations. 

To assess interactions between the AIL and the folded domains in the monomer 

simulations, scaling map analysis was performed. In this analysis, a simulation of 
the AIL as a true self-avoid random coil is performed to generate a reference state, 
and then the mean inter-residue distances obtained in the full simulations are nor- 
malized by the distances obtained from this reference. The self-avoiding random 
coil simulations are performed using an identical protocol to the full simulations, 
with the exception that the only contribution to inter-atomic interactions comes 
from the repulsive part of the Lennard-Jones potential, meaning no attractive inter- 
atomic interactions or solvation effects are experienced. This ensures we generate a 
sequence and structure-specific self-avoiding random coil ensemble that provides 
a true reference state. Extensive details on the technical aspects associated with the 
generation of this reference state have been described previously”. The scaling 
maps enable us to easily identify local regions that engage in interactions that cause 
deviations from self-avoiding random coil behaviour. 
HAT assays. The standard autoacetylation HAT assay was performed using ['4C] 
acetyl-CoA (Perkin-Elmer). Autoacetylation of p300 was quantified by autoradi- 
ography after SDS-PAGE gel analysis. The p300s preparations were equilibrated 
in 1x HAT buffer (25 mM TRIS-HCL, pH 7.5, 100 mM NaCl, 1 mM DTT, 10% 
glycerol and 1x Complete EDTA-free protease inhibitor (Roche)) for 10 min at 
30°C before initiation of the reaction by the addition of 200\.M [‘C]acetyl-CoA 
for the indicated time points. For experiments containing IRF3 STAT! or the 
enhancer RNA (eRNA) KIf6, autoacetylation assays were performed at a fixed 
equimolar concentration (2|1M) of p300s and the indicated transcription factor 
or KIf6. Assays were performed in triplicate with different batches of proteins and 
on different days. At the indicated time point, 5 11 of the reaction was quenched 
by addition of 511 of 2x SDS gel loading buffer followed by analysis on a 4-20% 
SDS-PAGE gel. Experiments shown in Fig. 1c were performed in 1 x kinase buffer 
(20 mM HEPES, pH 7.5, 250 mM NaCl, 20 mM {-glycerol phosphate, 1 mM 
sodium vanadate, 10 mM MgCl, 1 mM DTT, 1 mM ATP and a mix of 2014M 
['4C]acetyl-CoA and 80|M cold acetyl-CoA (A2056, Sigma). 1 1M of p300s was 
incubated in the presence or absence of 11M IRF3, 2).M TBK1 and 11M STING 
(lanes 1-6) or increasing amounts of STING as indicated (lanes 7-12). The gels 
were analysed by western blotting as indicated below or were fixed for 30 min in 
a solution containing 3% glycerol, 10% glacial acetic acid, 20% ethanol (v/v/v) in 
water. The gels were soaked for 5 min in a solution containing 1% glycerol, 5% 
PEG8000 in water and were dried for 30 min using a Bio-Rad Gel Dryer and the 
radioactivity quantified on a phosphorimage analyser (Typhoon, GE Healthcare) 
followed by analysis using ImageJ 1.8.0_112°). 


A p300 HAT scintillation proximity assay was designed similar to that 
described previously”*. In brief, as a substrate we used a synthetic histone H4 
peptide containing 15 amino acids derived from the N terminus of human H4 
that was chemically attached to biotin with an amino hexanoic linker (Biotin- 
C6- GRGKGGKGLGKGGAK) (from peptid.de). The synthetic peptide was 
re-suspended in water and adjusted to pH 7.0 with concentrated NaOH. 

A typical reaction contained p300s (50 nM), 12.5,.M biotinylated H4 peptide, 
acetyl-CoA (0.1,.M to 10M set at around 10 times the apparent K,,) in 20 mM 
Tris-HCI, pH 8.0, 150 mM NaCl, 5 pM ZnCh, 0.01% Tween-20 and 0.1% BSA 
(w/v). For reactions containing pIRF3, 50 nM was added. 201l of a 2x reaction 
mixture containing p300s, H4 peptide with and without pIRF3 was pre-incubated 
at 30°C for 5 min. The reaction was initiated by the addition of 20,11 of 2x 
acetyl-CoA containing a 1:3 mix of tritiated [7H]acetyl-CoA (PerkinElmer; 
NET290050UC) with cold acetyl-CoA. For example, for 10|1M final acetyl- 
CoA concentration, a mix of 541M [*H]acetyl-CoA and 15M cold acetyl-CoA 
(A2056, Sigma) was used. The reaction was quenched at the indicated time points 
by delivering 40 1] of the reaction mix into 1201] of 0.5 M HClin a FlashPlate Plus 
Streptavidin 96-well scintillant-coated microplate (Perkin Elmer, SMP103001PK). 
The plate was incubated for 1 h, and light emission was counted in a MicroBeta2 
Scintillation Counter (Perkin Elmer) at 1 min per well in the top count mode. 
Counts per minute (cpm) were plotted against acetyl-CoA concentration. Typical 
progress curves are shown in Extended Data Fig. 1d. The initial rate was estimated 
by linear regression during the first 10 min of the reaction and plotted against 
acetyl-CoA concentration. All data were analysed using GraphPad Prism 7.0. 

For the results shown in Fig. 4, acetylation reactions were performed in acetyla- 
tion reaction buffer HAT (25 mM Tris-HCl, pH 7.5, 100 mM NaCl, 0.1 mM EDTA, 
1mM DTT, 10% glycerol and 1 x complete EDTA-free protease inhibitor (Roche)) 
with 501M acetyl CoA (Sigma), 100 ng ml" trichostatin A and 21g of purified 
histone octamer. Reactions were incubated for 30 min at 30°C and stopped by 
the addition of 3 x SDS gel loading buffer, then used for Coomassie staining and 
immunoblotting. 

Multi angle laser light scattering-size exclusion chromatography. Before SEC- 
MALLS runs, p300 variants were acetylated and deacetylated using p300 HAT or 
SIRT2 as described previously”°. The reactions were analysed by liquid chroma- 
tography—mass spectrometry as described previously~”. Size-exclusion chroma- 
tography was performed at a flow rate of 0.5 ml min“! on a Superdex 200 Increase 
10/300 GL column equilibrated in SEC-MALLS buffer (20 mM HEPES, 300 mM 
NaCl, 5 pM ZnCh, 0.5 mM TCEP) at 21°C. A 50 il sample of p300 at 2 mg ml"! 
was injected onto the column and multi angle laser light scattering was recorded 
with a laser emitting at 690 nm using a DAWN-EOS detector (Wyatt Technology 
Corp.). The refractive index was measured using a RI2000 detector (Schambeck 
SFD). The molecular weight was calculated from differential refractive index meas- 
urements across the centre of the elution peaks using the Debye model for protein 
using ASTRA software version 6.0.5.3. 

In vitro eRNA transcription. eKlf6 eRNA corresponding to 496 nucleotides of 
the sense strand of human chr13:5802100-5802596 (ref. **) was produced by in 
vitro transcription from a pMA plasmid containing a eKIf6 insert synthesized by 
GeneArt Gene Synthesis (Thermo Fisher). pMA_KIf6 plasmid (501g) was line- 
arized with 80 U of KpnI-HF in a final volume of 100,11 and incubated at 37°C 
for 14-16 h. The in vitro transcription reaction was done in a final volume of 
1 ml, using 1 x T7 buffer, T7 RNA Polymerase and 1 U of RNaseOUT Recombinant 
Ribonuclease Inhibitor (Thermo Fisher). After incubation for 2 h at 37°C, 0.5 U 
of TURBO DNase (2 Up!) (Thermo Fisher) and 1 11M CaCl, was added to the 
reaction and incubated for 30 min at 37°C. Following DNasel treatment, 21] of a 
30 mg ml’ stock of proteinase K powder (Thermo Fisher), dissolved in proteinase 
K buffer (10 mM TRIS pH 7.5, 1 mM CaCl, and 40% glycerol), was added and 
incubated for 45 min at 37°C. Buffer was exchanged into 20 mM HEPES, pH 7.5, 
300 mM NaCl, 0.5 mM TCEP using Amicon Ultra-0.5 ml centrifugal filters 
(molecular weight cut off 3 kDa, EMD Millipore). To further purify the RNA, 
3 volumes of TRIzol (Thermo Fisher) was added to the RNA sample, followed by 
isopropanol precipitation. Purified Kflé RNA was resuspended in 20 mM HEPES, 
pH 7.5, 300 mM NaCl and 0.5mM TCEP. RNA was quantified using a Nanodrop 
spectrophotometer (Thermo Fisher). The quality of K1f6 was assessed by agarose 
gel electrophoresis in 1 x TBE buffer or using denaturing 6M urea 14% PAGE 
(Extended Data Fig. 7c). 

Immunoblotting, immunofluorescence and antibodies. We have used the cell 
lines COS and H1299. They are not on the list of commonly misidentified cell lines 
maintained by the International Cell Line Authentication Committee. COS cells 
were purchased from ATCC, product reference ATCC-CR1-1651, lot no. 4171903. 
H1299 cells were authenticated on December 6th 2016 by LGC Standards: Cell Line 
Authentication Service. Cell lines were authenticated using short tandem repeat 
(STR) analysis as described in 2012 in ANSI Standard (ASN-0002) Human cell lines 
were authenticated using standardization of STR Profiling by the ATCC Standards 
Development Organization as described in ref. **. Upon the receipt of COS cells 
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and after the authentication of H1299 cells, they were expanded. A mycoplasma 
contamination test (MycoAlert Mycoplasma Detection Kit, Lonza cat no. LT07- 
418) was performed and the mycoplasma-free cells were frozen and kept in liquid 
nitrogen. After thawing they were kept in culture for 30 passages with a myco- 
plasma contamination test after 15 passages. For immunoblotting, proteins were 
separated on 4-12% Bis-Tris SDS-PAGE gel (NuPAGE precast gel, Thermo Fisher) 
and transferred onto a nitrocellulose membrane (Hybond C+, GE Healthcare). 
Membranes were blocked with 5% skim milk in PBST buffer (PBS, 0.1% Tween-20) 
and probed with anti-p300(K1499ac) rabbit polyclonal antibody (1:2,500 dilution; 
Cell Signaling, 4771), anti-Kac rabbit polyclonal antibody (1:2,500 dilution; Cell 
Signaling, 9441), anti-Flag mouse monoclonal antibody (1:2,500 dilution; Sigma, 
F1804), anti-HA rabbit polyclonal antibody (1:2,500 dilution, Abcam, ab9110). For 
the detection of STAT1 or IRF3 phosphorylation, the membrane was blocked with 
5% milk in PBST followed by overnight incubation at 4°C with anti-phospho-Stat1 
(Tyr701) rabbit monoclonal antibody (1:2,500 dilution; Cell Signaling no. 9171) 
in PBST buffer containing 5% BSA. For detection of IRF3 S396 phosphorylation, 
we used anti-phospho IRF3 $396 rabbit monoclonal antibody (1:2.500 dilution; 
Cell Signaling no. 4947). Incubations were performed as above. Membranes 
were washed extensively in PBST buffer before and after incubation with anti- 
rabbit or anti-mouse HRP-conjugated secondary antibody (1:10,000 dilution; GE 
Healthcare, NA934 or NA931), and protein bands were visualized on film after 
the ECL reaction (ECL Prime, GE Healthcare). Immunofluorescence was done 
as described previously” or as follows: 24 h post-transfection, cells were treated 
with a DMSO control or with 2.5 ,.M (final concentration) of HAT inhibitor A-485 
(P. Cole, Harvard Medical School) or p300/CBP Bromodomain inhibitor (CBP30, 
Sigma, no. SML1133) dissolved in DMSO. The final DMSO concentration in all 
assays was 0.25%. After 24 h, cells were rinsed once with RNase-free PBS 1X. Next, 
cells were permeabilized in freshly prepared 0.2% Triton (Sigma) buffer for 5 min 
and were then fixed in freshly prepared 4% formalin solution (Sigma) for 10 min at 
room temperature. After three washes in RNase-free 1X PBS at room temperature, 
the cells were incubated in 5% skim milk in 1X PBS for 30 min and then probed 
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with anti-HA high-affinity monoclonal antibody (1:100 dilution; Roche Applied 
Science, cat. no. 11867423001) overnight at 4°C. Cells were washed extensively 
with 1X PBS before and after incubation with Alexa Fluor 488-conjugated second- 
ary antibody (1:500 dilution; Invitrogen, no. A-11006) for 1 h at 37°C. Cells were 
counterstained with Hoechst (250 ng ml!) and examined under a confocal laser 
scanning microscope (LSM510, Zeiss). 

Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability 

Coordinates for the p300 core structure and BARP bound to a diacetylated his- 
tone H4 peptide are available from the Protein Data Bank (PDB) under accession 
numbers 6GYR and 6GYT, respectively. Source data are available for Fig. 1b, fand 
Extended Data Fig. 1d. Figure 1d shows the initial velocities from reactions shown 
in Extended Data Fig. 1d. 
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Extended Data Fig. 1 | The effect of IRF3 or STAT] activation and 
oligomerization on p300 autoacetylation. a, The domain structure of 
IRF3. The truncation construct used is shown at the bottom. b, Size- 
exclusion chromatography of IRF3 variants. Red, unphosphorylated IRF3; 
blue, phosphorylated pIRF3; green, C-terminally truncated IRF3AC. 
Representative data of three independent experiments are shown. c, A 
constant amount of p300s (241M) was incubated alone or in the presence 
of C-terminally truncated IRF3AC (21M) for the indicated time points. 
Samples were analysed by SDS-PAGE followed by Coomassie staining 
and autoradiography. d, Progress curves of HAT scintillation proximity 
assay. Histone H4 substrate acetylation in the presence (green) or absence 
(black) of pIRF3 and varying concentrations of [*H]acetyl-CoA. The 
degree of histone H4 substrate acetylation at different time points and the 
initial velocity (cpm min’) at the indicated acetyl-CoA concentrations 


ARTICLE 


were determined and plotted in Fig. le. Three independent experiments 
were performed and the mean value and error bars representing the 
standard deviation are shown. e, The domain structure of STAT1. The 
truncation constructs used are shown at the bottom, and the Tyr701 
phosphorylation site is indicated. f, Uncropped images of SDS-PAGE 
gels shown in Fig. 1d. The '4C autoacetylation signal of p300s is shown at 
the bottom. g, Size-exclusion chromatography of STAT1 variants. Black, 
STAT1ANC; green, STATIAN; red, Y701-phosphorylated pSTAT1ANC; 
blue, Y701-phosphorylated pSTAT1AN. h, SDS-PAGE analysis of STAT1 
variants and analysis by western blotting. Top, Coomassie staining of 
SDS-PAGE gel; middle, PonceauS staining; bottom, western blot using 
anti-Phospho-Stat1 (Tyr701). Representative data of three independent 
experiments are shown. For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 2 | Crystal packing of the p300 core molecule. 

a, There are four p300 molecules (monomers I-IV) in the asymmetric 
crystallographic unit. The four molecules show an antiparallel 
arrangement of the BRP-HAT domains. As a result, HAT domains from 
monomers | and II are closely apposed. Monomers III and IV engage 
monomer IVgym and monomer IIIym, respectively, of a neighbouring 


RING 


Monomer II 


Cc Monomer II 


2 RMSD 


crystallographic unit, showing that all promoters are in a AIL loop-swap 
conformation. Black arrows indicate the direction of the AIL. The 
disordered segment of the AIL is shown as a black dotted line. 

b, c, Electron density of the AIL. 2F, — F. (b) and F, — F, (c) difference 
density omit maps contoured at 0.8 and 2.0 r.m.s.d., respectively. Coloured 
as in Fig. 3. 
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Extended Data Fig. 3 | Structural analysis of the RING domains. RING domains adopt multiple conformations. b, 2F, — F, (blue mesh) 

a, Superposition of the four p300 molecules (monomers I-IV) in the and anomalous difference Fourier maps (orange mesh) for the four RING 


asymmetric crystallographic unit. Whereas the bromodomains (Bd), PHD —_ domains contoured around 1o and 2.50, respectively. 
and HAT domains superpose with a r.m.s.d. of approximately 0.9 A, the 
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Extended Data Fig. 4 | Regulation of HAT activity by flanking domains. 
a, The domain structure of p300. Sequence conservation of the AIL is 
shown using WebLogo™. The constructs used are shown. b, Analysis of 

in vitro expression of the indicated p300 variants. Purified proteins were 
analysed for autoacetylation by immunoblotting with anti-p300(K1499ac) 


72950 73050 73150 


antibody (left), anti-Flag antibody (middle) and Coomassie staining 
(right). Representative data of three independent experiments are shown. 
c, Representative mass spectrometric analysis of BRP_HAT_ZZ_AAIL 
after in vitro expression (red) and after SIRT2 mediated deacetylation 
(black). 


© 2018 Springer Nature Limited. All rights reserved. 


2 2 
a < < 
k ~ al a es < 
< <x < ia z = £ 
a x, z x, 5, 4, o oO 
oa o oa a 
rd rd rnd & & < & 
a a a =x = = = 
eS$SoSS80S8S8 L688 FSFE GSSGES 
ng sBSRsSRsSRsSR SSR SSR SEER 
Anti-FLAG 
p300 
Anti-p300 
K1499Ac 
Coomassie 


(p300 proteins) 


Anti-H3K56ac 


Coomassie 


Coomassie 


Anti-H2AKac4 


Coomassie 


Anti-H4Kac4 


Coomassie 


12345 67 8 9 101112 13 14151617 18 19 20 21 22 


C 
histone H4 


» Ki6ac 


HA p300 


p53ac 


DNA 
Hoechst 


merge 


Extended Data Fig. 5 | Regulation of HAT activity by flanking domains. 
a, The AIL contributes to histone substrate acetylation of activated 

p300. The details of the constructs used are indicated in Extended Data 
Fig. 4. Defined amounts of p300 variants were incubated with acetyl- 

CoA and the indicated histones before SDS-PAGE analysis, followed by 
Coomassie staining and western blotting with the indicated antibodies. 

b, The indicated amounts of purified p300s variants were incubated 

with histone octamers as in a, followed by SDS-PAGE and immunoblot 
analysis with the indicated antibodies. Anti-Kac, pan-acetyl-lysine 
antibody. Representative data of three independent experiments are 
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shown. ¢, Crystal structure of the H4(K12ac/K16ac) peptide bound to 

the BARP module containing an in-frame RING deletion. Amino acid 
residues 1169-1241 were replaced by a single glycine residue. The deletion 
removes the RING domain (black arrow) and does not adversely affect 

the structural integrity of the BARP module. d, e, Indicated variants 

of p300 were co-expressed with p53 in H1299 cells and analysed by 
immunofluorescence with the indicated antibodies (d) or by western 
blotting (e). Representative data of three independent experiments are 
shown. Scale bars, 101m. 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


a 


RING. AIL Monomer II 


— BRP_HAT_CH3 
— BRP_HAT Ac — BRP_HAT_CH3 Ac 
to 7 BRP_HAT | — BRP_HAT_CH3 AAIL 


0.0 T 


—— BRP_HAT_CH3 AAIL Ac 


Monomer II 


— BRP_HAT_CH3 Ac 
— BRP_HAT_CH3 
— BRP_HAT Ac 


— BRP_HAT 


te+6 


Molecular mass (Da) 


10 " 12 13 


Elution Volume (mL) 


BRP_HAT 
noAc 


73538.7 


BRP_HAT Ac 
11Ac 


2 
a 


Relative Intensity (a.u.) 


73450 


73600 73750 73900 74050 74200 


Deconvoluted Mass (amu) 


74350 74500 


g BRP_HAT_CH3 AAIL 


TAc? 
no Ac 
86363.7 


BRP_HAT_CH3 AAIL Ac 


4Ac 


w 
> 
a 


Relative Intensity (a.u.) 


86350 


86450 86550 86650 86750 


Deconvoluted Mass (amu) 


86850 86950 


Extended Data Fig. 6 | Autoacetylation changes the hydrodynamic 
properties of p300. a, Simulations of the AIL in the context of the loop- 
swapped dimer. Left, cartoon of the trajectory of the AIL (dashed line). 
Right, representative conformations with the AIL C, backbone atoms are 
coloured according to charge. b, SEC-MALLS analysis of deacetylated 
(blue) and acetylated (yellow) p300 core. Note the decrease in elution 
volume upon acetylation. c, SEC-MALLS analysis of deacetylated (blue), 
acetylated (red) BRP_HAT_CH3 and deacetylated (black) and acetylated 
(green) BRP_HAT_CH3 AAIL. There is no increase in elution volume 
upon acetylation of the AAIL construct. d, Comparison of acetylated 
and deacetylated BRP_HAT and BRP_HAT_CH3. The deacetylated 
BRP_HAT (green) and deacetylated BRP_HAT_CH3 (blue) elute at the 
same position, which is indicative of a similar hydrodynamic radius. The 
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acetylated BRP_HAT (yellow) and BRP_HAT_CH3 (red) elute at a larger 
elution volume. The normalized refractive index is plotted as a function 
of elution volume from an $200 column coupled to a MALLS detector. 
Calculated molecular masses are plotted as a function of volume for each 
eluted peak. The experiment was carried out at least three times with 
similar results. One representative example of each sample is shown. 

e, Mass spectrometry analysis (electrospray ionization) of the BRP_HAT 
before (blue) and after (yellow) autoacetylation. The molecular mass 

and the number of acetylation events are indicated. f, Mass spectrometry 
analysis of BRP_HAT_CH3 before (blue) and after (red) autoacetylation. 
g, Mass spectrometry analysis of BRP_HAT_CH3_AAIL before (black) 
and after (green) autoacetylation. 
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Extended Data Fig. 7 | Molecular model and controls showing that 
p300 acetyltransferase activity is not stimulated by eRNA. a, p300 
is maintained in the inactive state by deacetylases such as SIRT2. IRF3 

is autoinhibited by a C-terminal segment in the IAD domain. b, TBK1 
phosphorylation activates and dimerizes IRF3. The activated IRF3 dimer 
engages the IBID domain of p300. c, Recruitment of two molecules of 
p300 results in trans-autoacetylation in the AIL loop and HAT activation. 
d, Activated p300 can acetylate chromatin and engage acetylated 
substrates via the bromodomain. e, A constant amount of p300s (2 1M) 
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was incubated in [’*C]acetyl-CoA alone or in the presence of 21M KIf6 
eRNA for the indicated time points. Samples were analysed by SDS-PAGE 
followed by Coomassie staining (top) and autoradiography (bottom). f, As 
in e but in the presence of 0.5 mM EDTA. The experiment was carried out 
at least twice with consistency. One representative example is shown. 

g, Quality control of KIf6 RNA. 31g K1f6 was deposited on a 1% agarose 
gel or a 14% 6 M urea PAGE gel and detected by SYBR Safe stain. 

M, 100-bp DNA ladder. 
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Extended Data Table 1 | Data collection, phasing and refinement statistics 


Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, B,y (°) 
Resolution (A) 
No. reflections 
Rsym or Rimerge 
I/ol 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
Rwork / Fitree 
No. atoms 
Protein 
Lys-CoA ligand 
Zinc 
H4 K12AcK16Ac 
B-factors (mean; A’) 
Proteins 
RING domains 
Lys-CoA ligand 
H4 K12AcK16Ac 
R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


BRP-HAT 
P2, 


100.7, 146.6, 116.3 
90, 91.7, 90 
50-3.10* 

106462 

8.7 (89.7)* 

7.48 (0.7)* 

99.0 (94.0)* 

1.9 (1.9)* 


50-3.1 
0.19 / 0.26 
19370 
19100 
256 

14 


95.0 
164.8 
69.2 


0.013 
1.73 


*Data from one crystal. Values in parentheses are for highest-resolution shell. 
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Extended Data Table 2 | Thermodynamic analysis of the interaction between p300 BRP and histone peptides by ITC 


Protein Peptide (residues Ka (uM N* 
BRP H3 unmodified (1-20) -- No binding 
H3 K4me1 (1-20) -- No binding 
H3 K4me3 (1-20) -- No binding 
H3 K9ac (1-20) -- No binding 
H3 K14ac (1-20) 1761 +356 1.13 
H3 K18ac (1-20) large 1.01 
H3 K9acK14ac (1-20) 578 + 47 1.06 
H3 K14acK18ac (1-20) 104 +27 1.03 
H3 K23ac (11-30) -- No binding 
H3 K18acK23ac (11-30) -- No binding 
H3 S10pho (1-20) -- No binding 
H3 T11pho (1-20 -- No bindin 
BRP H4 unmodified (4-24) -- No binding 
H4 K5ac (4-24) -- No binding 
H4 K8ac (4-24) 58 +12 1.04 
H4 K5ack8ac (4-24) 828 + 10 1.03 
H4 K8acK12ac (4-24) 90+18 1.01 
H4 K12ac (4-24) 71+14 1.05 
H4 K16ac (4-24) -- No binding 
H4 K12K16diac (4-24) 25+5 0.99 
H4 K20ac (1-24) 305 +6 1.00 
H4 K16acK20ac (1-24) 205 + 10 1.238 
H4 K5K8K12K16K20penta-ac (1-24) 38 +4 1.07 
H4 Sipho K5K8K12K16K20penta-ac (1-24) 5442 1.12 
BRP_N1132A H4 K20ac (1-24) -- No binding 
H4 K16acK20ac (1-24) -- No binding 
H4 K5K8K12K16K20penta-ac (1-24) -- No binding 
H4 Sipho K5K8K12K16K20penta-ac (1-24) -- No binding 
BRP AIL (1545-1562) - No binding 
BRP AIL K1549K1558K1560tri-ac (1545-1562) -- No binding 


Mean and s.d. were determined from experiments performed in triplicate. Horizontal lines separate experiments involving different histone peptides or different protein constructs. 

Histone peptide sequences: H3 (1-20) ARTKQTQRKSTGGKAPRKQL, H3 (11-30) T@GKAPRKQLATKASRKSAP, H4 (4-24) GKGGKGLGKGGAKRHRKVLRD. 

AIL (Autoinhibitory loop peptide) SCNAKKKNNKKTSKNKSS (1545-1562). No binding was detected to non-acetylated peptides, or with a construct containing a mutation that abolishes acetyllysine 
binding (N1132A). 

«Binding stoichiometry. 
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Extended Data Table 3 | Summary of SEC-MALLS and mass spectrometry experiments 


Sam p le MMvus MMi Acetylation MMszs 

Da Da level Da (2 mg-mI' 
BRP_HAT 73538 73538 ~O 73380 + 1.5% 
BRP_HAT Acetyl 73918 73538 >8 71690 + 1.6% 
BRP_HAT_CH3 92893 92891 ~O 92810 + 2.0% 
BRP_HAT_CHS Acetyl 93270 92891 >9 90700 + 2.0% 
BRP_HAT_CHS AAIL 86363 86362 ~O 84650 + 2.2% 
BRP_HAT_CHS3 AAIL Acetyl 86446 86362 >2 80450 + 1.7% 


Column labelling is as follows: MMys; molar masses determined by mass spectrometry; MMi», the theoretical molar mass calculated from the appropriate primary sequences, acetylation levels were 

estimated based on the mass differences as compared to the non-acetylated sample; MMsis, molar masses determined by SEC-MALLS at a concentration of 2 mg ml-!. All p300 constructs contained 
the mutation Y1467F. The experiment was carried out at least three times with consistency. Results from one representative example are shown. The mass and errors reported for SEC-MALLS are the 
weight average molar mass and residual standard deviations of the observed data from the fitted values calculated using ASTRA. 
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A two per cent Hubble constant measurement from 
standard sirens within five years 


Hsin-Yu Chen!?*, Maya Fishbach? & Daniel E. Holz?*4 


Gravitational-wave detections provide a novel way to determine the 
Hubble constant!-3, which is the current rate of expansion of the 
Universe. This ‘standard siren’ method, with the absolute distance 
calibration provided by the general theory of relativity, was used to 
measure the Hubble constant using the gravitational-wave detection 
of the binary neutron-star merger, GW170817, by the Laser 
Interferometer Gravitational-Wave Observatory (LIGO) and 
Virgo’, combined with optical identification of the host galaxy™® 
NGC 4993. This independent measurement is of particular interest 
given the discrepancy between the value of the Hubble constant 
determined using type Ia supernovae via the local distance ladder 
(73.24 + 1.74 kilometres per second per megaparsec) and the value 
determined from cosmic microwave background observations 
(67.4+0.5 kilometres per second per megaparsec): these values 
differ”* by about 3c. Local distance ladder observations may achieve 
a precision of one per cent within five years, but at present there are 
no indications that further observations will substantially reduce the 
existing discrepancies?. Here we show that additional gravitational- 
wave detections by LIGO and Virgo can be expected to constrain 
the Hubble constant to a precision of approximately two per cent 
within five years and approximately one per cent within a decade. 
This is because observing gravitational waves from the merger of 
two neutron stars, together with the identification of a host galaxy, 
enables a direct measurement of the Hubble constant independent 
of the systematics associated with other available methods. In 
addition to clarifying the discrepancy between existing low-redshift 
(local ladder) and high-redshift (cosmic microwave background) 
measurements, a precision measurement of the Hubble constant is 
of crucial value in elucidating the nature of dark energy!%!!. 

We explore the expected constraints on the Hubble constant (Ho) from 
gravitational-wave standard sirens. The gravitational-wave data provide 
a direct measurement of the luminosity distance to the source, but the 
redshift must be determined independently. We consider gravitational- 
wave events both with (‘counterpart’) and without (‘statistical’) direct 
electromagnetic measurements of the source redshift, and carry out an 
end-to-end simulation of the Hy measurement from a simulated dataset 
consisting of 30,000 binary neutron star (BNS) mergers and 60,000 binary 
black hole (BBH) mergers. We include realistic measurement uncertain- 
ties, galaxy peculiar velocities and selection effects in our analysis. 

We anticipate that most, if not all, BNS mergers detected in gravita- 
tional waves will have an electromagnetic counterpart (for example, 
from associated isotropic!”!3 kilonova emission®®) that will allow a 
unique host galaxy identification’. Assuming that the BNS population 
is similar to the population of short \-ray bursts, we expect the typical 
offset between a kilonova and its associated host galaxy to be no more 
than’ 100 kpc. Since Advanced LIGO-Virgo BNS detections will be 
within 400 Mpc, it will be possible to identify host galaxies as faint 
as 0.003L, (apparent magnitudes <23), where L, is the B-band 
Schechter function parameter (see Methods), with modest observa- 
tional resources. We find that in this counterpart case, the fractional 
Ho uncertainty will scale roughly as 15% /-/N, where N is the number 


of BNS mergers detected by the LIGO-Hanford, LIGO-Livingston and 
Virgo network (HLV). Throughout, we quote fractional Hp measure- 
ment uncertainties defined as half the width of the symmetric 68% 
credible interval divided by the median. If the Kamioka Gravitational 
Wave Detector (KAGRA) and LIGO-India join the detector network 
(HLVJI), this convergence improves slightly to 13% /-/N,, because a 
five-detector network tends to provide better measurements of the 
source inclination, and therefore distance, owing to the improved polar- 
ization information. 

We note that the representative fractional Hy measurement uncer- 
tainty 7, (15% for the three-detector network, and 13% for the five- 
detector network), is smaller than the typical width of the Hp measurement 
from an individual event (GW170817 provided an unusually tight 
measurement; see Extended Data Fig. 1). This is because for a single 
event, the Ho posterior probability density function is a highly 
non-Gaussian function; the distance-inclination degeneracy leads to 
long tails up to large distances (and low Hp values) for edge-on sources, 
and tails in the opposite direction for face-on sources. Combining these 
asymmetric distributions leads to a1/-/N convergence with a smaller 
effective 0}, than the width of a typical single-event Hj measure- 
ment!>'°, Furthermore, owing to the asymmetry of the single-event 
measurements, it may take about 20 events to reach the expected 1 / a/N 
convergence rate. For example, we may get lucky in the first few events 
and get an unusually good Hp measurement (GW170817 is an excellent 
example of this), after which we will converge more slowly than 1/./N 
for some time as we detect average events. After about 20 events, how- 
ever, we will have a sufficient statistical sample of detections to have 
converged to a representative 0}, for the population. At this point, the 
combined Hp measurement approaches a Gaussian distribution and we 
reach the expected 1/./N behaviour. 

To predict how the Hy measurement improves with time, we consider 
the BNS rates inferred!” from GW170817, 15405. Gpe3 yr, 
together with the planned network sensitivity and duty cycle, to com- 
pute the expected number of detections at each observing stage. 
Figure 1 shows the improvement in the Hp measurement as BNS detec- 
tions with unique host galaxies (and associated redshifts) are accumu- 
lated. We start with a 15% prior measurement on Hp, representing the 
constraint* from GW170817; we approximate this by a Gaussian 
centred at 67.8 km s~! Mpc~' with a standard deviation of 10.2 km s~! 
Mpc, but the exact centre and shape of the Hy posterior do not affect 
our results. The rate of detections will increase as the gravitational- 
wave network improves in sensitivity between LIGO-Virgo's third 
observing run (‘03 HLV’), HLV at design sensitivity (“Design HLV’), 
and the five-detector network (‘Design HLVJY), from an average of five 
BNS detections per year in O3 HLYV, to 32 and 39 detections per year 
for Design HLV and Design HLVJI, respectively. The merger rate 
provides the major source of uncertainty in predicting the Hy measure- 
ment error. The solid line in Fig. 1b shows the average Hp measurement 
error over 100 realizations assuming the median BNS merger rate, while 
the lower/upper bounds of the shaded band assume the upper/lower 
90% bounds, respectively, on the merger rate inferred from GW170817. 
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Fig. 1 | Projected number of BNS detections and corresponding 
fractional error for the standard siren Hy measurement. a, The expected 
total number N of BNS detections for future observing runs, using the 
median merger rate (solid green curve) and upper and lower rate bounds 
(shaded band). b, The corresponding Hp measurement error, defined as half 
of the width of the 68% symmetric credible interval divided by the posterior 
median. The band corresponds to the uncertainty in the merger rate shown 
in a. These measurements assume an optical counterpart, and the associated 
redshift, for all BNS systems detected with gravitational waves. 


We find that, if it is possible to independently measure a unique 
redshift for all BNS events, the fractional uncertainty on Hp will reach 
2% (at the 1a level) by the end of two years of HLV at design sensi- 
tivity (in about 2023; corresponding to about 50 events), sufficient to 
arbitrate the current tension between local and high-z measurements 
of Ho. After about 100 BNS events, gravitational-wave standard sirens 
would provide a 1% determination of Hp. This is expected to happen 
after about two years of operation of the full HLVJI network (around 
2026), but given the rate uncertainties, it could happen many years later, 
or could happen as early as 2023. 

Not all sources will have associated transient electromagnetic coun- 
terparts: we may fail to identify the counterparts to some BNS mergers, 
and counterparts are not expected for BBH mergers. For cases where 
a unique counterpart cannot be identified, it is possible to carry out a 
measurement of the Hubble constant using the statistical approach. 
To do this, the redshifts of all potential host galaxies within the 
gravitational-wave three-dimensional localization region are incor- 
porated, yielding an Hp measurement that is inferior to what can be 
calculated using a counterpart, but is still informative once many detec- 
tions are combined. This means that, in the absence of a counterpart, 
only those gravitational-wave events with small enough localization 
volumes yield informative Hy measurements. If the localization volume 
is too large, it contains a large number of potential host galaxies, which 
will largely wash out the contribution from the correct host galaxy. 
Additionally, it may be difficult in practice to construct a complete gal- 
axy catalogue over a large volume with precise galaxy redshifts. We find 
that for BNSs without counterparts, combining the Hy measurement 
from events that are localized to within 10,000 Mpc? (approximately 
40% of events) yields identical constraints to the combined measure- 
ment using the full sample—events localized to greater than 10,000 
Mpc? do not contribute to the measurement. For this reason, we use 
only the sources localized to within 10,000 Mpc? for the no-counterpart 
projections in Fig. 2. We note that for all of the no-counterpart curves in 
Fig. 2, we start with a flat Hp prior in the range 50-100 km s~! Mpc. 

Because BBH systems tend to have much larger localization volumes 
than BNS systems (as they are more massive and found at greater dis- 
tances), the statistical Hy measurement for BBHs converges very slowly, 
even though they are detected at higher rates. We consider both ‘light 
(components of mass 10M, where Mo is the solar mass; denoted as 
‘10M,-10M,.’) and ‘heavy’ (components of mass 30M; denoted as 
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Fig. 2 | Projected fractional error for the standard siren Hy measurement 
for BNSs and BBHs for future gravitational-wave detector networks. The 
green shaded band corresponds to the BNS rate uncertainty (see Fig. 1); the 
same uncertainty applies to the ‘BNS without counterpart’ curve. For the 
‘without counterpart’ curves, we adopt a statistical standard siren approach 
using only events localized to within 10,000 Mpc’; events with larger 
volumes do not contribute noticeably (see the text). Constraints from BBH 
systems without counterparts are inferior, despite higher rates, owing to the 
larger numbers of potential host galaxies per event. 


‘30M>-30M.q’) BBHs, assuming merger rates of 80*3) Gpc-* yr“! for 
the 10M.-10M. BBHs and 18 Gers Gpc~? yr! for the 30M.-30M5 
BBHs!8 (see Methods). Only about 3% of the light BBHs and about 0.5% 
of the heavy BBHs are localized to within 10,000 Mpc?, which means 
that we expect to detect only 16*}} well localized BBHs by 2026. This 
leads to an approximately 10% Hy measurement with BBHs by 2026. 
We note that the constraints from statistical BBH standard sirens 
improve if the BBH rates are on the high end, as well as if the BBH mass 
function favours low masses. 

For the projections in Fig. 2, we assumed that galaxies are distrib- 
uted uniformly in a comoving volume and that complete catalogues are 
available. If we incorporate the clustering of galaxies due to large-scale 
structure, the convergence rate in the statistical case improves by a 
factor of about 2.5 (see Methods). Incorporating this large-scale struc- 
ture effect, we find that we will still need to detect more than about 50 
BNSs without a counterpart to reach a 6% Hy measurement, compared 
to only ten BNSs or fewer with a counterpart. Meanwhile, accounting 
for galaxy catalogue incompleteness provides an additional source of 
uncertainty (see equation (5) in Methods), which can cancel out some 
of the improvement due to large-scale structure. For example, for a 
galaxy catalogue completeness of 50%, the Hp measurement would be 
degraded by about a factor of two. Therefore, incorporating the effects 
of large-scale structure and catalogue incompleteness, we expect that 
in practice the Ho constraints in the statistical case will be slightly better 
than our prediction in Fig. 2, where the precise factor depends on prop- 
erties of the relevant host galaxies and completeness of the catalogue. 

Besides the with-counterpart and without-counterpart cases, we can 
also anticipate a situation in which we have a counterpart detection but 
no unambiguous host association. For example, an optical counterpart 
could be relatively isolated on the sky without a clearly identified host 
galaxy, or may have multiple possible host galaxies. In this case we can 
pursue a pencil-beam strategy, for example, focusing on the volume 
within 100 kpc of the counterpart (see Methods). For BNSs this will 
reduce the relevant volume to about 10 Mpc’, for which we expect 
to have only about one potential host galaxy or galaxy group, which 
thereby reduces to the with-counterpart case. 

In addition to the BNSs and BBHs discussed here, we can expect merg- 
ers of a neutron star and a black hole!*~*!, and these may have detectable 
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electromagnetic counterparts. Although the rates for these systems 
are uncertain and expected to be low, they will also be seen to greater 
distances than BNS systems, which may render them useful as standard 
sirens”. 

We note that our measurements of distance do not use any astro- 
physical modelling. However, associated electromagnetic observations 
(for example, from short ~\-ray burst afterglows or jet breaks) can provide 
additional constraints on the inclination, and thereby improve the indi- 
vidual measurements*”? of distance, leading to a tighter measurement 
of Hp. In this sense our counterpart results can be considered a con- 
servative estimate. However, one of the advantages of standard sirens 
is that they are ‘pure’ measurements of luminosity distance, avoiding 
complicated astrophysical distance ladders or poorly understood cali- 
bration processes, and instead are calibrated directly by the theory of 
general relativity to cosmological distances. By introducing additional 
constraints based on astronomical observations (for example, inde- 
pendent beaming measurements or estimates of the mass distribution 
or equation of state of neutron stars), there is the potential to introduce 
systematic biases that could fundamentally contaminate the standard 
siren measurements. In the present analysis we do not consider these 
additional constraints, although they may indeed have an important 
part to play in future standard siren science. 

Eventually systematic errors in the amplitude calibration of the detec- 
tors may become a source of concern, because the luminosity distance 
is encoded in the amplitude of the gravitational-wave signal. However, 
the calibration uncertainty is currently limited by the photon calibrator 
to around 1%, and this is likely to improve”*; we look forward to an era 
where sub-1% calibration becomes a necessity, but this is a number 
of years away. Another possible source of distance uncertainty is grav- 
itational lensing. However, at the typical redshift of BNS and BBH 
systems (z< 0.5 at design sensitivity) the effect will be minor relative 
to the uncertainty from the distance-inclination degeneracy”’. In addi- 
tion, for sufficient numbers of sources the effects of lensing will average 
away’. Of course, gravitational-wave cosmology is a new field, and 
unforeseen systematics could certainly arise as we push our measure- 
ments to the 1% level and beyond. 

We stress that our projected Ho constraints are subject to several 
important uncertainties, the largest one of which is the merger rate of 
BNS and BBH systems. The detection rate for BBHs depends sensitively 
on the mass distribution, which is not currently well constrained”*”’, 
Future detections will bring a better understanding of the merger rates 
and mass distributions of compact objects, allowing for improved 
predictions. Nevertheless, it is clear that gravitational-wave standard 
sirens will provide precision constraints on cosmology in the upcoming 
advanced-detector era of gravitational-wave astronomy. 
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METHODS 


We present our method for inferring cosmological parameters from gravitation- 
wave (GW) and electromagnetic (EM) measurements. We first simulate a repre- 
sentative sample of GW detections for a range of detector configurations. We then 
simulate the analysis of these data sets, and explore the resulting standard siren 
constraints. In what follows we highlight important aspects of our calculation, such 
as the role of peculiar velocities and selection effects. 

Synthetic events and host galaxies. Measuring Ho with standard sirens relies on 
our ability to extract the luminosity distance and sky position of GW sources. We 
follow the procedure in ref. ** to localize synthetic BNS merger and BBH merger 
detections. The population of binaries are distributed uniformly in comoving vol- 
ume ina Planck (2015)*° cosmology (2, = 0.308, 24, = 0.692, Hy = 67.8). We 
assume that the BNS merger rate follows the rate measured in ref. '’. To estimate 
the merger rate of 1OM5-10Mj and 30M,-30M. BBHs from the rate measured 
in ref. '8, we assume that the BBH mass function follows a Salpeter power law and 
use 10Mo-10M. BBHs to characterize all BBHs with primary component masses 
between 5M and 15M, and 30M,-30M. BBHs to characterize all BBHs with 
primary component masses between 20M, and 50Mo. We do not place additional 
cuts on the secondary masses, which are distributed uniformly between 5M. and 
the primary mass. 

The detection rate of sources depends on the sensitivity, observing time and 
duty cycle of the GW detector network. We assume that the LIGO- Virgo network 
operates for one year at projected O3 sensitivity, followed by two one-year-long 
observing runs of LIGO-Hanford+ LIGO-Livingston-+ Virgo (HLV) at design sen- 
sitivity and two one-year-long runs of the five-detector network, LIGO-Hanford+ 
LIGO-Livingston+Virgo+KAGRA+LIGO-India (HLVJJ), at design sensitivity”. 
We take the combined duty cycle to be 0.5 for the HLV detector configuration and 
0.3 for HLVJI. The number of detections is subject to Poisson statistics, and we 
simulate detections according to the merger rate, network sensitivity, observing 
time and duty cycle. 

To determine whether a binary merger is detected, we calculate the matched-filter 
signal-to-noise ratio (SNR) for each simulated binary. We draw the ‘measured’ SNR 
from a Gaussian distribution centred at the matched-filter value with a standard 
deviation of c= 1. Binary mergers are detected only if their measured network SNR 
is greater than 12. For each detected merger, we calculate its three-dimensional 
localization according to the methods in ref. ** (We have verified that this proce- 
dure yields results which are consistent with the full parameter estimation pipeline, 
LALInference*’.) The three-dimensional localization takes the form of a posterior 
probability distribution function, p(a, 6, D;|dgw), over the sky position (a, 6) and 
luminosity distance, D;, given the GW data, daw. 

The GW signal from each detected binary merger provides a measurement 
of D;. To calculate Ho, we must also measure a redshift for each binary merger. 
Throughout, we take the redshift, z, to be the peculiar-velocity-corrected redshift; 
that is, the redshift that the source would have if it were in the Hubble flow. We con- 
sider two cases: the redshift information either comes from a direct EM counter- 
part, such as a short )-ray burst/afterglow and/or a kilonova (‘with-counterpart’), 
or a statistical analysis over a catalogue of potential host galaxies (‘statistical’). 

In the with-counterpart case, we assume that the EM counterpart is close 
enough to its host galaxy that the host can be unambiguously identified, and we 
can measure its sky position and redshift. This is a reasonable assumption based 
on the distribution of offsets between short y-ray bursts and their host galaxies, 
assuming that short -\-ray bursts trace a population similar to that of BNS mergers, 
and taking into account that detected BNS mergers will be at much lower redshifts 
than the short y-ray burst population. We assume that the sky position of each host 
galaxy is perfectly measured (that is, with negligible measurement error), meaning 
we can fix the source sky position to the location of the counterpart in the GW 
parameter estimation (rather than marginalizing over all sky positions). The GW 
distance posterior changes slowly over the sky and therefore is not sensitive to the 
precise location of the counterpart. However, since the GW sky localization areas 
can be very large, fixing the source position can lead to important improvements 
in the distance, and hence Ho, measurements. We also assume that the peculiar- 
velocity-corrected redshift, z, is measured with a 1¢ error of 200 km s~! c~!, where 
cis the speed of light in vacuum, which is a typical uncertainty for the peculiar 
velocity correction?!?. 

In the absence of an EM counterpart we cannot identify a single host galaxy, 
and must use a catalogue of all potential host galaxies’. To simulate the galaxy 
catalogues we consider two cases: a uniform-in-comoving-volume distribution of 
galaxies, and a distribution that follows the large-scale structure as simulated by 
the MICE galaxy catalogue***”. In the uniform distribution case, we construct a 
mock catalogue by distributing galaxies uniformly in comoving volume with a num- 
ber density of 0.02 Mpc” 3. This corresponds to the number density of galaxies 
that are 25% as bright as the Milky Way, assuming the galaxy luminosity function is 
described by the Schechter function’* with B-band parameters 6, =1.6 x 10-7? Mpe3, 
ap=—1.07, Lp=1.2 x 10°73, and h=0.7 (where Ly is the solar luminosity 


in the B band), and integrating down to 0.16L; to find the luminosity density. 
(This corresponds to 83% of the total luminosity*?*".) The lower luminosity limit 
of the MICE catalogue is similar. Thus, we assume that only galaxies brighter than 
0.16L5 can host binary mergers, although we note that the population of host 
galaxies is currently uncertain, and we can modify the assumed luminosity limit 
by including the effects of catalogue incompleteness. A lower luminosity 
limit would increase the galaxy density and weaken the Hp constraints in the 
statistical case. 
Hp uncertainty. Not all GW events contribute equally to the Hy measurements. 
In the counterpart case, the fractional error on the Hp measurement from a single 
source depends on the fractional distance uncertainty of the GW source and the 
fractional redshift uncertainty of its host galaxy. To first order, this is: 

“u) (2) 
eae ee (1) 

Vi D, 
where ‘1gal’ denotes the case of a uniquely identified host galaxy and vj is the 
peculiar-velocity-corrected “Hubble velocity’. Because the recessional velocity 
uncertainty, Op is typically around 150-250 km s“!, the fractional recessional 
velocity decreases with distance. Meanwhile, the fractional distance uncertainty 
scales roughly inversely with SNR, and therefore tends to increase with distance. 
There is thus a ‘sweet spot, at which the peculiar velocities and the distance uncer- 
tainties are comparable; for LIGO-Virgo’s second observing run; this was about 
30 Mpc, near the distance of GW170817. The distance of the sweet spot will 
increase as the networks become more sensitive; for detectors at design sensitivity 
the ideal BNS standard siren distance will be about 50 Mpc. At distances beyond 
this, the distance uncertainty will tend to dominate the peculiar velocity uncer- 
tainty; in this regime, the nearest (highest SNR) events tend to provide the tightest 
Hp constraints. This can be seen in Extended Data Fig. 1, which shows the 
fractional Hp uncertainty for individual events, plotted against the median poste- 
rior distance and 90% posterior localization volumes. However, we note that the 
relationship between median distance, localization volumes, and fractional Ho 
uncertainty is not very tight. Prior to identifying the counterpart for a particular 
event, we can estimate the accuracy of the Hy measurement from the width and 
central value (for example, median) of the GW distance posterior according to 
equation (1), using an estimated vy 70(D,) km s-! Mpc”! where (D;) is the 
median GW distance. (Here we must use the GW posterior marginalized over the 
sky position, as we do not yet know the sky position of the counterpart.) We verify 
that this estimate of the combined distance and redshift uncertainty is a reasonable 
proxy for the resulting Hp uncertainty, assuming that an EM counterpart is found 
and provides an independent measurement of redshift. 

In the absence of a counterpart, we cannot assign a unique host, and so the 
Hp error increases with the number of potential host galaxies in the localization 
volume. Galaxy clustering can mitigate this, as we discuss in the main text. For 
example, in the case of GW170817, the optical counterpart was found in NGC 4993, 
which is a member of a group of about 20 galaxies, all of which have an equiva- 
lent Hubble recessional velocity*”. On the other hand, catalogue incompleteness 
degrades the Hp measurement, as we have to consider an additional background 
of uniformly distributed galaxies (see equation (5)). 


Bayesian model. For a single event with GW and EM data, dgw and dem, we can 
write the likelihood as: 


lgal 


J p(dow> dep Dp 0; 6, 2|Hp) dD, dadédz 


(2) 
BCAp) 


P(dgw> dpm lH) = 


where we have included a normalization term in the denominator, (3(Ho), 
to account for selection effects and ensure that the likelihood integrates to unity. 
We can factor the numerator in equation (2) as: 


J Pldows dpe Drs 0% 8,2|H)AD,dadédz 
7 J P(dgwIDp a 6)p(demlz; a, 6)p(Dz|z, H)p, (z; 0, 6|H) dD, dadédz 
= J P(dgwiD,, o, 6)p(demlz, &, 6)5(D;—D;,(z, Hp))p, (z, a, d|Hp) dD, dadédz 
= f Pldw1B, (es Hy) 0s 8)PCd gues 8)p, (2 a1 d|H))dadddz (3) 
where D, (z, Hy) denotes the luminosity distance of a source at redshift z, given a 


Hubble constant of Hp and leaving all other cosmological parameters fixed to the 
Planck values“® (2 ‘My = 9.308, 24, = 0.692). We can, alternatively, marginalize 
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over these other cosmological parameters, but since most detected binaries will be 
at low redshifts, the effects of other cosmological parameters on the z-D, relation 
are small. The term p(dgw|D1,a,6) is the marginalized likelihood of the GW data 
given a compact binary source at distance D; and sky position (a, 4), marginalized 
over all other parameters. Throughout, we assume that we can construct a cata- 
logue of the potential host galaxies for each event, and take the prior po(z,a,6|Ho) 
to be a sum of Gaussian distributions centred at the measured redshifts and sky 
positions of the galaxies: 


2 @, oO, 6|Hp) =~ Patalogue @, Os 6) 
a _ — (4) 
=— UNE o!1@NIa', o (a) NS, of] (5) 


gal 7? 


In practice, we ignore the uncertainties on the measured sky coordinates and treat 
the Gaussian distributions as é-functions centred at the measured a’, 5’ We take 
Z' to be the peculiar-velocity-corrected redshifts, and assume a standard deviation 
of co} =200 km s~ for each. In the above, we assign equal weights to each galaxy 
in the catalogue, but if we believe that certain galaxies are a priori more likely to 
be GW hosts, we can assign weights accordingly. For example, we can weight the 
galaxies in equation (4) by their stellar or star-forming luminosity, or by some 
assumed redshift-dependent coalescence rate of the GW sources. A critical 
assumption is that the sum in equation (4) contains the correct host galaxy. If we 
believe the catalogue is incomplete, we must replace our prior, po(z), with a 
weighted sum containing both the observed galaxies, equation (4)—weighted by 
the overall completeness fraction of the catalogue—and a smooth, uniform- 
in-comoving-volume distribution accounting for the unobserved galaxies: 


Py (Z 0 6|Ho) = PPrtatogue (2% 8) + IP) Pigs (> Os 6p) (5) 


catalogue 
where P catalogue is given by equation (4), and: 

av, 
dzdadé 


where Pcomplete(2,0,6) is the probability of a galaxy at (z,a,6) being in the catalogue. 
Meanwhile the completeness fraction fis given by: 


aise (z, a, 6|Hp) x [1—P, (z, a, 6)] (6) 


complete 


2max 


1 . a, 
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where Zmax is the maximum galaxy redshift considered in the analysis of an indi- 
vidual event, and V-(Zmax) is the total comoving volume enclosed within Zmax. 

In the case where we have an EM counterpart, the likelihood p(dgm|z,a,6) picks 
out one of the galaxies in the catalogue, so that the sum in the prior reduces to a 
single term corresponding to the EM-identified host galaxy. In the case where 
there is no EM counterpart, the EM data are uninformative, and we set the like- 
lihood p(dgm|z,a,6) cx constant. In the case where we have an EM counterpart but 
cannot pick out a unique host galaxy, one could consider a ‘pencil beanY containing 
all the potential host galaxies within around 100 kpc in projected distance on the 
sky. We assume the sky position of the counterpart is perfectly measured to be 
(a, 5), and take the term p(dpyi|z,,6) to be a top hat that picks out all of the gal- 
axies within some angular radius r (corresponding to around 100 kpc in 
projected distance) of the counterpart’s sky position. Thus, the numerator of 
equation (3) reduces to: 


P (dew ID (2, Ho), & 5), (2,0 d)dzdads 
((a,6)|(@6)) <r 


(8) 


and we sum over all galaxies within about 100 kpc in projected distance, but no 
longer weight them by the likelihood of the GW source at the corresponding sky 
position. Alternatively, we can incorporate assumptions about the kick distribution 
in the form of p(dgy|z,a,6) and place more weight at galaxies close to (a, 6) rather 
than assuming a simple top hat. Although for simplicity we do not apply the pencil- 
beam approach in this Letter, it can be thought of as a natural interpolation between 
the counterpart and statistical cases. 

To calculate the normalization term, ((Hp), in the denominator of equation (2), 
we must account for selection effects in our measurement process. In general, 
the GW and EM data are both subject to selection effects in that we only 
detect GW and EM sources that are above some threshold, d&y and djiy, 
respectively. Accounting for these detection thresholds, the denominator of 
equation (2) is: 


4 
(Hy) = J J J Pldgyr dg Dyn 0s 6,2|Hy)dD,dzdaddddewade, (9) 
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We define: 
cw, Sy 
Piet (Dira.2)= ff pldewlDir6,2)ddew (10) 
dew>déw 
and similarly: 
EM _ 
Piet (20,8)= fp days [2 0r8) ddan ij 


dem>dthy 


With these definitions, equation (9) becomes: 


B(Hy) = J PSN (Dy (z, Ho), 0, 5, Z)Pin (2, 0 6)p, (z, a, d)dadédz 


Note that we have applied the same chain-rule factorization to the inner four 
integrals as in equation (3). It is clear that the normalization factor (Ho) depends 
on Hp, so it is crucial to include it in the likelihood. For the EM selection effects, 
we assume that we can detect all EM counterparts and host galaxies up to some 
maximum true redshift, Zax. This is an over-simplification of the true EM selection 
effects, but is a reasonable assumption for the real-time galaxy catalogues that will 
be constructed during the EM follow-up to GW events. For example, at Advanced 
LIGO design sensitivity, 90% of 30M>-30M. BBH detections will be within 5 Gpc 
(and BNS detections will be within 0.3 Gpc)* (see http://gwe.rcc.uchicago.edu/). 
Furthermore, only the BBHs with the smallest localization volumes contribute to 
the Hp constraints, and these will typically be within 400 Mpc. A galaxy with the 
same absolute magnitude as the Milky Way would have an apparent magnitude 
of <23 at typical 30Mo-30M. BBH distances, or <17.5 for well localized BBHs, 
and <17 at typical BNS distances. Meanwhile, we expect kilonova counterparts to 
BNS mergers to have magnitudes of <21.7 on the first night, even at the farthest 
distances detectable by the HLVJI network at design sensitivity (assuming the 
optical counterpart to GW170817 is typical'**). This is well within the magnitude 
limits of upcoming survey telescopes (for example, the Large Synoptic Survey 
Telescope), as well as within reach of current instruments (such as the Dark Energy 
Camera, the Subaru Hyper Suprime-Cam, the Zwicky Transient Facility and so on). 

We assume that EM counterparts are detectable for binaries regardless of the 
binary inclination. Although the short -ray burst emission is expected to be 
beamed, the associated kilonovae are expected to emit isotropically. Furthermore, 
as GW170817 demonstrated, it is possible to identify a kilonova counterpart 
independently of the y-ray burst. We note that since face-on/face-off binaries 
are louder (have higher SNR) than edge-on binaries in GWs, the population of 
detected binaries will show a preference for face-on/face-off; our analysis repro- 
duces the expected inclination distribution among detected sources (see figure 4 
of ref. *°). Under these assumptions for the EM selection effects, we have: 


Pilz, a, 6) Xx H(Zmax — 2) (12) 


where H is the Heaviside step function, and equation (9) reduces to: 


2max 


B(H,) = J J PEW (Dy (2, Ho)s 0 5, Z)P, (Zs 0 6)dadédz 
0 


Meanwhile, we assume that the galaxy distribution is approximately isotropic on 
large scales, so that the galaxy catalogue prior can be factored as: 


Py (Zs 04 6) © py (Z)p, (a 6) (13) 


and we approximate po(a,,6) by a continuous, isotropic distribution on the sky. We 
note that this assumption would only introduce systematic errors if the galaxy dis- 
tribution had strong correlations with the sky sensitivities of the detectors, which is 
not to be expected. Although GWs experience redshifting, the effect of redshift on 
the detectability of the GW data is small. We can therefore define: 


PSN (Dy) = J Piet (Dr 048), (0% 8 dads (14) 


We assume a detection threshold for GW sources corresponding to a matched- 
filter network SNR pn = 12, so that the detection probability, PEW (D)), is the prob- 
ability that a source at distance D, will have SNR p> 12. Assuming a distribution 
of orientations, masses and spins among a population of sources, in addition to the 
assumed isotropic distribution on the sky, po(a,6), we calculate the fraction of 
sources that are detectable at a given distance, PEN (D,). We assume an isotropic 
distribution of orientation angles, and fix spins to be zero. For simplicity, we assume 
a monochromatic mass distribution for each type of source. (For example, we take 
all BNS sources to be 1.4Mo-1.4Mo.) We therefore have: 


2max 


BCH) = fo PSA" (Diz Hy))p, (2)d2 (15) 
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We note that for GW sources in the local Universe (D; < 50 Mpc): 


~ CZ 
D, (z, Hy) & — 
1, (Z, Ho) Fy 


(16) 


If we assume that the distribution of galaxies is uniform in comoving volume, then 
in the local universe, we can approximate: 


2 


Py (2) xz (17) 


With these approximations, assuming that EM selection effects are negligible 
(Zmax— 00), (Ho) is independent of the masses of the source, which determine 
the distance to which it can be detected. In fact, under these assumptions, (3(Ho) 
simplifies to: 


B(Ho) « He (18) 


However, in general, we must account for cosmological deviations from 
equations (16) and (17), so we calculate G(Ho) according to equation (15) throughout 
our analysis. We note that 3(Hp) is still only weakly dependent on the GW horizon 
and therefore on the (unknown) underlying mass distribution of GW sources. 
Nevertheless, the statistical framework described here can accommodate more 
complicated models of the GW source distribution and its effects on the detection 
probability (equation (14)). 


Data availability 

Source Data for Figs. 1, 2 and Extended Data Fig. 1 are provided with the online 
version of the paper. Other data that support the findings of this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Fig. 1 | Hy uncertainty for BNS systems with identified is due to the ‘sweet spot’ between peculiar velocities and distance 
counterparts and redshifts. Each point is the Hp uncertainty CH, froma uncertainties, as discussed in the text. We find that, in general, closer 
simulated detection with the Advanced HLV network operating at design events have smaller localization volumes and lead to better constraints on 
sensitivity, as a function of the 90% localization volume. The colours Hp, although the closest events yield slightly worse constraints because of 
correspond to the median of the GW distance measurement. The lower their larger fractional peculiar velocity uncertainties. 


limit to the precision of individual measurements of about 3 km s~' Mpc”! 
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Device-independent quantum random-number 


generation 


Yang Liu!?, Qi Zhao*, Ming-Han Li!, Jian-Yu Guan!?, Yanbao Zhang”, Bing Bai)’, Weijun Zhang*, Wen-Zhao Liu)?, 
Cheng Wu!?, Xiao Yuan!%, Hao Li’, W. J. Munro®, Zhen Wang’, Lixing You’, Jun Zhang!*, Xiongfeng Ma>*, Jingyun Fan!?*, 


Qiang Zhang!** & Jian-Wei Pan!?* 


Randomness is important for many information processing 
applications, including numerical modelling and cryptography'”. 
Device-independent quantum random-number generation 
(DIQRNG)** based on the loophole-free violation of a Bell 
inequality produces genuine, unpredictable randomness without 
requiring any assumptions about the inner workings of the devices, 
and is therefore an ultimate goal in the field of quantum information 
science*~’. Previously reported experimental demonstrations of 
DIQRNG®” were not provably secure against the most general 
adversaries or did not close the ‘locality’ loophole of the Bell test. 
Here we present DIQRNG that is secure against quantum and 
classical adversaries!*"!?. We use state-of-the-art quantum optical 
technology to create, modulate and detect entangled photon pairs, 
achieving an efficiency of more than 78 per cent from creation to 
detection at a distance of about 200 metres that greatly exceeds 
the threshold for closing the ‘detection’ loophole of the Bell test. 
By independently and randomly choosing the base settings for 
measuring the entangled photon pairs and by ensuring space-like 
separation between the measurement events, we also satisfy the 
no-signalling condition and close the ‘locality’ loophole of the Bell 
test, thus enabling the realization of the loophole-free violation of a 
Bell inequality. This, along with a high-voltage, high-repetition-rate 
Pockels cell modulation set-up, allows us to accumulate sufficient 
data in the experimental time to extract genuine quantum 
randomness that is secure against the most general adversaries. 
By applying a large (137.90 gigabits x 62.469 megabits) Toeplitz- 
matrix hashing technique, we obtain 6.2469 x 10” quantum- 
certified random bits in 96 hours with a total failure probability 
(of producing a random number that is not guaranteed to be 
perfectly secure) of less than 10~°. Our demonstration is a crucial 
step towards transforming DIQRNG from a concept to a key aspect 
of practical applications that require high levels of security and 
thus genuine randomness’. Our work may also help to improve our 
understanding of the origin of randomness from a fundamental 
perspective. 

The security, or unpredictability, of randomness generated by a 
device-independent quantum random-number generator can be 
assessed via the observation of the loophole-free violation of a Bell 
inequality. A Bell test involves two entangled particles, with each party 
choosing the measurement settings according to a random input and 
outputting a classical bit. To create a device-independent quantum 
random-number generator based on the violation of a Bell inequality, 
two sets of conditions must be fulfilled rigorously and simultaneously. 

First, it is necessary in the experimental implementations of 
DIQRNG to detect entangled particles with high efficiency in order to 
close the detection loophole and to ensure the no-signalling condition— 
which requires that there is no information exchange between the 


preparation of input randomness and the preparation of entangled 
particles, or between the measurement setting at one detector and 
measurement outcome at the other—by space-like separating relevant 
events. Alternatively, proper shielding could be applied to prohibit 
communications between relevant events”*; however, in practice, it 
is impossible to shield all of the known and unknown types of com- 
munication. Although recent progress in loophole-free tests of Bell 
inequalities'*-'® provides a way of realizing DIQRNG based on the 
violation of Bell inequalities, the implementation demands unprece- 
dented detection efficiency and system stability. Therefore, DIQRNG 
remains a formidable challenge. 

Second, an independent and identical distribution (i.id.) must 
not be assumed for the behaviour of the adversary, the most general 
quantum adversary should be considered in the security analysis and 
the production of random bits must occur at a non-vanishing rate 
and be noise-tolerant. In the i.i.d. scenario, assuming that the adver- 
sarial strategy follows a predetermined probability distribution, the 
security analysis is greatly simplified, even without considering any 
internal memory or time-dependent behaviour; however, the i.id. 
assumption in these theories fails in practice because, for example, 
the adversary may attack the system using the previous results in an 
adaptive, non-i.i.d. way. Although security against the most general 
quantum adversaries and without the iid. assumption has been rigor- 
ously proved’*!’-?!, a method for security analysis that is efficient for 
a non-infinite amount of data (and can therefore be tested experimen- 
tally) was demonstrated only very recently. A method for DIQRNG that 
does not use the i.i.d. assumption and that considers a general quantum 
adversary was proposed recently!”, based on the entropy accumula- 
tion theorem?!. With this method, the rate of randomness generation 
approaches the value for the i.i.d. case in the limit of a large amount 
of data. We have previously presented an experimental demonstration 
of DIQRNG” that closed the detection loophole but did not consider 
space-like-separated events. Here we report fully functional DIQRNG, 
evidenced by rigorously satisfying the two sets of conditions discussed 
above in our experimental set-up and by accounting for general quan- 
tum adversaries in the security analysis. The device-independent quan- 
tum random-number generator that we demonstrate outputs genuinely, 
quantum-certified random bits at a rate of 181 bits s~!—an important 
step towards practical applications. 

Our realization of DIQRNG is based on a sequence of Bell-test exper- 
iments in the format of the Clauser-Horne-Shimony-Holt (CHSH) 
game’, We assume neither modelling of the physical apparatus nor 
any relation between different experimental trials. Time-dependent 
or memory-like effects may be present across experimental trials. We 
adopt the spot-checking protocol'®!*4 for our experimental imple- 
mentation. In experimental trial i, Alice and Bob, who are spatially 
separated, each receives a photon from an entangled pair. A classical 
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Fig. 1 | Schematics of the experiment. a, Top view of the experimental 
layout. Alice's and Bob’s measurement stations are on the opposite 

sides of the source of entangled photon pairs, at distances of 93 + 1m 
and 90 + 1 m, respectively, where the errors quoted are system errors. 
(We measure the distance by holding whiteboards and measuring the 
distance with a laser ranger; the error measured this way is estimated 

to be less than 1 m.) b, Middle, creation of pairs of entangled photons. 
Light pulses of 10-ns duration and 200-kHz frequency from a 1,560-nm 
laser diode (LD) are amplified by an erbium-doped fibre amplifier 
(EDFA) and frequency-doubled in an in-line periodically poled lithium 
niobate crystal (PPLN); the red arrow indicates the direction of the light. 
The resultant 780-nm light pulses are focused into a periodically poled 
potassium titanyl phosphate crystal (PPKTP) in a Sagnac loop to generate 


bit ¢; = 0 or t; = 1 is generated with probability 1 — q or q, respectively. 
If t; = 0, then the trial is a ‘generatior trial, with fixed inputs for Alice 
and Bob, x; = 0 and y; = 0, respectively. If t; = 1, then the trial is a ‘test 
trial to test against adversaries. In our experiment, we set all trials to 
be test trials by choosing q = 0. In each test trial, Alice and Bob each 
receives a bit from a quantum random-number generator, x;, y; € {0, 1}, 
as an input that determines their measurement setting. Alice’s (Bob’s) 
measurement setting is not affected by Bob’s (Alice’s) measurement set- 
ting or measurement outcome, and is independent of the entanglement 
creation at the source. Hence, the experiment satisfies the no-signalling 
condition. We assume that the two random inputs, x; and y,, are created 
independently of the rest of the experiment, and that their creation is 
iid. for all of the n trials. The corresponding measurement outcomes 
are aj, b; € {0, 1}. We assign a CHSH game value of J; = 1 if a; 6 bj = xy; 
and of J; = 0 otherwise. 

Considering uniform inputs for all n experimental trials (so that the 
probability of selecting any x;, y; € {0, 1} is 1/4), the CHSH game value 
J over all n experimental trials is 


J=+y7J-3/4 
M j=1 


Lens DM QWP HWP Filter Coupler Polarization ppxtp Pockels cell PBS WDM 
controller 


Il 


PPLN 


polarization-entangled photon pairs. A half-wave plate (HWP) and two 
quarter-wave plates (QWPs) are used to control the relative amplitude 
and phase in the polarization-entangled two-photon state that is created. 
Left and right, measuring the polarization of single photons. The single 
photons exit the fibre, experience the polarization-state measurement in 
free space and are collected into single-mode optical fibres to be detected 
by superconducting nanowire single-photon detectors (SNSPDs). The 
apparatus that is used to perform the measurement consists of a Pockels 
cell, a quarter-wave plate, a half-wave plate and a polarizing beam splitter 
(PBS). Quantum random-number generators (QRNGs) output random 
bits, triggering the Pockels cell to switch between two polarization 
orientations. DM, dichroic mirror; WDM, wavelength-division 
multiplexer. Underlying map in a from Google, DigitalGlobe. 


The experiment is subject to various possible loss mechanisms. We 
require that the photon loss is low enough to close the detection loop- 
hole. Any adversarial strategies based on local hidden-variable models 
yield J <0. Therefore, J > 0 indicates that the measurement outcomes 
cannot be pre-determined and therefore represent genuine, unpredict- 
able quantum randomness. 

The amount of unpredictable randomness that can be extracted in 
the presence of the quantum adversary system E is quantified by the 
smooth min-entropy””: 

Hes 


min(AB|XYE) 2 AR oy lEs EBA> Wexp) 


with smoothing parameter ¢,, expected CHSH game value wep and 
failure probability for the entropy accumulation protocol eg,q. X and Y 
denote the input sequences for Alice and Bob, respectively, and A 
and B the corresponding output sequences. The lower bound of the 
generation rate, Ropi(Es, EEA, Wexp)) is used as the theoretical amount of 
randomness on average for each trial. See Supplementary Information 
section I.B for details. By using a Toeplitz-matrix hashing extractor 
with a size of n x [H,s,,(AB|XYE) — t,], where ¢, is the number of bits 
sacrificed to minimize the information that an adversary may acquire 
(a parameter that is relevant to the failure probability of the extractor), 
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we extract H;s,, (AB|XYE) — t, random bits with genuine unpredicta- 
bility from the raw data obtained in the n experimental trials. The 
random bits are €, + e:4 + 2~* close to a uniform distribution, where 
2-* is the failure probability in extraction, with tf. = 100. 

Although we do not assume the inner workings of the devices, we 
still require a few assumptions in our experimental implementation: 
(1) that the devices and adversaries behave according to the laws 
of either classical or quantum mechanics; (2) that Alice's and Bob’s 
devices are located in one secure laboratory so that the adversaries 
cannot access their measurement outcomes; (3) that Alice and Bob 
each receives a sequence of uniform random bits to determine their 
measurement setting from an independent, trusted source; and (4) that 
Alice and Bob each has a trusted classical post-processing unit to 
extract the final random bits. In general, device-independent protocols 
(in particular for quantum key distribution) assume that Alice’s and 
Bob’ devices are located in two secure laboratories and in between them 
there is a classical authentication channel. In these cases, the publicly 
announced information may be leaked to the adversaries. The secu- 
rity is then compromised by reusing the devices. Any attack based on 
this premise is known as a memory attack”. With the above assump- 
tions and by ensuring no information leakage, our implementation is 
secure against memory attacks. In our experiment, quantum random 
numbers are required as inputs to switch the measurement basis in 
the Bell inequality. In this sense, our experiment can be seen as a type 
of randomness expansion, which generates more randomness from a 
random seed. An interesting future direction of research would be to 
create nearly perfect random numbers from weak randomness, which 
may require more than one set of Bell-test equipment. 

Our experimental implementation is depicted in Fig. 1. We create 
entangled photon pairs at a wavelength of 1,560 nm using spontaneous 
parametric downconversion in a Sagnac interferometer (see Methods). 
We then distribute the two photons of a pair in opposite directions 
to Alice’s and Bob’s measurement stations, which are at distances of 
93 m and 90 m from the source, respectively. A detailed space-time 
analysis (Fig. 2) shows that the relevant events in the experiment are 
space-like-separated (Supplementary Information section I.E). We 
obtain an overall efficiency from the creation to the detection of the 
entangled photons of 78.8% + 1.9% for Alice and 78.5% + 1.5% for 
Bob” (where the errors quoted are one standard deviation), surpass- 
ing the threshold to close the detection loophole (Supplementary 
Information section II.C). 

To achieve the maximum violation of the Bell inequality’, we create 
a non-maximally polarization-entangled two-photon state, 
cos(22.05°)|HV) + sin(22.05°)| VH)and choose the measurement sett- 
ings to be —83.5° (for x; = 0) or —119.4° (for x; = 1) for Alice and 6.5° 
(for y; = 0) or —29.4° (for y; = 1) for Bob when measuring the polari- 
zation state of the entangled photons. The measurement settings are 
selected randomly by the quantum random-number generators in each 
experimental trial. The two quantum random-number generators are 
based on vacuum noise fluctuation (Supplementary Information 
section II.A). 

Our system is now robust against noise, which allows us to complete 
n= 6.895 x 10'° experimental trials in 95.77 experimental hours, oper- 
ating continuously. To quantify the significance of our experimental 
results, we perform a hypothesis test of local realism. The null hypoth- 
esis is that the experimental results are explainable by local realism 
under the assumption that the input distribution at each trial is uni- 
form. The evidence against local realism, under the above assumption, 
is quantified by a statistical P value computed using a test statistic. 
The P value is the maximum probability according to local realism 
that the statistic takes a value as extreme as the observed one. Hence, 
small P values imply strong evidence against local realism. We apply 
the prediction-based ratio (PBR) method of analysis** to design the 
test statistic and compute an upper bound for the P value. The PBR 
analysis provides valid upper bounds for P values without assuming 
the iid. condition. The upper bound that is returned after the whole 


experiment is pir = 107204792, indicating a strong rejection of local 
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Fig. 2 | Space-time diagram for the experimental design. T; = 10 ns 

is the time taken to generate a pair of entangled photons. Tgrngi,2 are 

the times required to generate random bits to switch the Pockels cells. 
Taelayi,2 are the lengths of time between the random bits being generated 
and delivered to the Pockels cells. Tpci,2 are the waiting times for the 
Pockels cells to be ready to perform state measurements after receiving 
the random bits. Ty, are the times taken by the superconducting 
nanowire single-photon detectors to output electronic signals. For 

TarRNGI = Tarne@2 = 96 ns, Tgelay1 = 270 ns, Taelay2 = 230 ns, Teci =112ns, 
Tpc2 = 100 ns, Ty = 50 ns and Ty = 100 ns, we place Alice’s and Bob’s 
measurement stations on opposite sides of the source at distances of 93 m 
and 90 m, respectively. The effective optical length between Alice's (Bob’s) 
station and the source is 132 m (119 m). This arrangement ensures no 
signalling between relevant events in the experiment. The shaded areas are 
the future light cones for the source, Alice and Bob. 


hidden-variable models (see Supplementary Information III.C). With 
the PBR method, we also test the null hypothesis that the experimental 
results satisfy the no-signalling condition under the assumption that 
the input distribution at each trial is uniform. In this case, we obtain an 
upper bound of png = 1, indicating no evidence of anomalous signal- 
ling in the experiment (see Supplementary Information section IL.B). 

We compute the CHSH game value J over n experimental trials to be 
J =2.757 x 10-*. By setting the expected CHSH game value to that 
measured in the experiment, Wexp = 2.757 x 10-4, and assuming that 
€, =z, = J1/n =3.8 x 10 °and that the width of the statistical con- 
fidence interval for the estimate of the Bell violation is 
Soc: = $10/n = 1.2042 x 10°, we find a total failure probability of 
€,+&, +2" <1 x 10~>. After developing a computing technique for 
fast Toeplitz-matrix multiplication (Supplementary Information sec- 
tion II.H) that allows us to apply an 137.90 Gb x 62.469 Mb Toeplitz- 
matrix hashing, we obtain 6.2469 x 10’ genuinely, quantum-certified 
random bits, corresponding to a rate of 181.2 bits s~ ! with a total failure 
probability of 10~°. The stream of random bits passes the National 
Institute of Standards and Technology (NIST) statistic test suite 
(Supplementary Information section III.A). As shown in Fig. 3, the 
amount of randomness generated with our experimental set-up is 
56.9% of the optimal value in the i.i.d. case for n = 6.895 x 101°, and 
asymptotically approaches the optimal value with an increasing number 
of experimental trials. In the inset of Fig. 3 we plot the randomness 
production as a function of time, demonstrating the robustness of the 
system. 

In conclusion, we report here the realization of DIQRNG that is 
secure against the most general quantum adversaries and outputs 
181 quantum-certified random bits per second. A next step will be 
to improve the violation of the Bell inequality and the stability of the 
system to achieve higher production rates of quantum-certified random 
bits for practical applications that require high levels of security. We 
anticipate that our work will be helpful in topics such as randomness 
amplification”, the minimum assumption necessary for randomness 
generation, and fundamental problems relating to the understanding 


of non-locality, entanglement and randomness’. 
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Fig. 3 | Randomness generation versus number of trials. The solid blue 
curve and dashed red curve show the theoretical rate of randomness 
generation H,5,, versus the number of trials n for a finite amount of data 
(up to 10!*) and the i.id. case, respectively. The green circle denotes our 
experimental result. The inset shows the number of random numbers 
generated versus time in our experiment. We set Wexp = 2.757 X 10-4, 

€, = &gq = 1/n and 6,4. = ./10/n for the finite data rate. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Entanglement creation, distribution, and detection at low loss. The experimen- 
tal layout is depicted in Fig. 1. A 1,560-nm laser outputs 10-ns laser pulses periodi- 
cally at a repetition rate of 200 kHz. The pulses are amplified by an erbium-doped 
fibre amplifier and frequency-doubled in an in-line periodically poled lithium 
niobate crystal. After removing the residual 1,560-nm light with a wavelength- 
division multiplexer and spectral filters, the 780-nm light pulses that are generated 
are focused into a periodically poled potassium titanyl phosphate (PPKTP) crystal 
with a poling period of 46.5 1m in a Sagnac loop to generate polarization-entangled 
photon pairs via a type-II spontaneous parametric downconversion process. 

We optimize the efficiency to couple the entangled photons that are created 
into a single-mode optical fibre by setting their beam waists to the theoretical 
optimized size with respect to that of the pump beam”®"03!, For the pump light 
exiting a 780HP single-mode fibre, which has a mode field diameter of 5 1m, 
we focus it with a focal length of f= 8 mm aspherical lens to the centre of the 
crystal at a distance of 70 cm, with a measured beam waist of 180 um and a beam 
quality factor of M? = 1.05. The entangled photons are collected into a SMF28e 
single-mode optical fibre, which has a mode field diameter of 10.4 sm. With a 
f= 11mm aspherical lens and a f= 175 mm spherical lens, we set the beam waist 
to be 85 jm at the centre of the PPKTP crystal. The spherical lens is at a distance 
of 19 cm from the aspherical lens and 45 cm from the PPKTP crystal. 

A half-wave plate and two quarter-wave plates in the beam path of the pump 
light are used to control the relative amplitude and phase in the polarization- 
entangled two-photon state. The residual 780-nm pump light is removed by 
dichroic mirrors. The entanglement source is placed on a 1 m x 1 m breadboard, 
with the ambient temperature stabilized to be within +1 °C to improve the stability 
of the system. 

The entangled photons are sent in opposite directions to two remote meas- 
urement stations that are 93 + 1 m and 90 + 1 m away, ensuring space-like sep- 
aration between the event of entanglement creation in the source and the event 


of choosing the measurement settings at the measurement stations, and between 
the event of choosing the measurement setting at one station and the events of 
choosing the measurement setting and outputting the outcomes at the other station 
(Supplementary Information section II.E). 

At the measurement station, the single photons exit the fibre, pass the Pockels 
cell, a quarter-wave plate and a half-wave plate, and a polarizing beam splitter, 
and are coupled into the single-mode optical fibre to be detected by the supercon- 
ducting nanowire single-photon detectors*”. The Pockels cell is switched to set the 
base for the measurement of the single-photon polarization upon receiving a bit 
from a quantum random-number generator. A time-digital converter is used to 
time-tag the events for random-number generation, single-photon detection and 
synchronization signal. 

We measure the overall efficiency from the creation to the detection of single 
photons to be 78.8% + 1.9% for Alice and 78.5% + 1.5% for Bob, surpassing the 
threshold to close the detection loophole. The loss is mainly due to the limited 
efficiency: 94% in collecting the photon pairs that are created into single-mode 
optical fibre, and 92% for the superconducting nanowire single-photon detectors 
(Supplementary Information section II.C). 


Data availability 

The data that support the findings of this study are available from the correspond- 
ing authors on reasonable request. Source Data for Fig. 3 is provided with the 
online version of the paper. 
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Exciton-polariton topological insulator 
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Topological insulators—materials that are insulating in the bulk 
but allow electrons to flow on their surface—are striking examples 
of materials in which topological invariants are manifested in 
robustness against perturbations such as defects and disorder!. 
Their most prominent feature is the emergence of edge states at the 
boundary between areas with different topological properties. The 
observable physical effect is unidirectional robust transport of these 
edge states. Topological insulators were originally observed in the 
integer quantum Hall effect” (in which conductance is quantized 
in a strong magnetic field) and subsequently suggested*-> and 
observed® to exist without a magnetic field, by virtue of other 
effects such as strong spin-orbit interaction. These were systems 
of correlated electrons. During the past decade, the concepts of 
topological physics have been introduced into other fields, including 
microwaves”*, photonic systems”’®, cold atoms!!!” acoustics!**4 
and even mechanics!°. Recently, topological insulators were 
suggested to be possible in exciton-polariton systems!* !8 organized 
as honeycomb (graphene-like) lattices, under the influence of a 
magnetic field. Exciton-polaritons are part-light, part-matter 
quasiparticles that emerge from strong coupling of quantum- 
well excitons and cavity photons!’. Accordingly, the predicted 
topological effects differ from all those demonstrated thus far. Here 
we demonstrate experimentally an exciton-polariton topological 
insulator. Our lattice of coupled semiconductor microcavities is 
excited non-resonantly by a laser, and an applied magnetic field 
leads to the unidirectional flow ofa polariton wavepacket around the 
edge of the array. This chiral edge mode is populated by a polariton 
condensation mechanism. We use scanning imaging techniques in 
real space and Fourier space to measure photoluminescence and 
thus visualize the mode as it propagates. We demonstrate that the 
topological edge mode goes around defects, and that its propagation 
direction can be reversed by inverting the applied magnetic field. 
Our exciton-polariton topological insulator paves the way for 
topological phenomena that involve light-matter interaction, 
amplification and the interaction of exciton-polaritons as a 
nonlinear many-body system. 

Microcavity exciton-polaritons (sometimes simply called polaritons 
hereafter) are composite bosons originating from the strong coupling 
of quantum-well excitons to microcavity photons. While the excitonic 
fraction provides strong nonlinearity, the photonic part results in a 
low effective mass, allowing the formation of a Bose-Einstein conden- 
sate that is dissipative in nature and can be driven by a laser beam””. 
Polaritons have thus been described as “quantum fluids of light”?!. For 
the epitaxially well-controlled III-V semiconductor system, several 
techniques are available to micropattern cavities and thus precisely 
engineer the potential landscapes of polaritons”’. With recent advances 
bringing topological effects into the realms of photonics and polari- 
tonics’-!°”3, several ways to realize topological edge propagation with 
polaritons have been suggested'*'8, with honeycomb geometries (‘arti- 
ficial graphene’™*) being of particular interest to realize a Chern band 
insulator with Chern number C=2 (refs !718). Polariton honeycomb 


lattices have been found to support Dirac-cone dispersions” as well as 
edge modes” inherited from their graphene origin”””*. Here, we take 
the next step and create a topological Chern insulator in a symbiotic 
part-light, part-matter system: the system of exciton-polaritons. Our 
experiments are based on the proposals!”!8 of a honeycomb potential 
landscape for exciton-polaritons, with its time-reversal symmetry bro- 
ken by an applied magnetic field. 

Let us first introduce the underlying physics and show its features in 
numerical simulations with a realistic set of sample parameters. The 
experiments display reliable condensation of polaritons in the vicinity 
of the Dirac cones and the creation of topological edge modes by apply- 
ing a magnetic field. Hence, we consider the injection of polaritons into 
the topological gap and focus on the inherent properties of the chiral 
edge modes. 

A general schematic of the experiment is presented in Fig. la. 
Figure 1b, c depicts the calculated dispersion relation of the honey- 
comb structure in the direction [ — K, connecting two Dirac cones 
(K and K’), without and with an applied magnetic field, respectively. 
The effective spin-orbit coupling of polaritons, induced by transverse 
electric/transverse magnetic (TE-TM) mode splitting, breaks the 
polarization-related symmetry, and thus each Dirac point transforms 
into four inverted parabolas'*. Whereas the spin-orbit interaction is 
extremely small in real graphene”’, ‘polariton graphene offers the pos- 
sibility of making the effective spin-orbit coupling large enough to 
open a sizable gap in a magnetic field. Without a magnetic field, the two 
central parabolas touch each other at the Dirac cones of the underlying 
honeycomb lattice (Fig. 1b). The degeneracy between the states in the 
crossing points can be lifted in the presence of a magnetic field anda 
finite Zeeman splitting. As a consequence, an energy gap forms near 
the Dirac cones (Fig. 1c). It is worth mentioning that the Dirac cones 
K and K’ are not equivalent. At the K (K’) point, the ‘valence’ band is 
formed from the B (A) pillars, and the ‘conduction’ band is formed from 
the A (B) pillars'®. The reversed order of the bands in the basis of the 
sublattices signifies that the gap is topologically non-trivial’. 

The interplay of an external magnetic field and the effective spin- 
orbit coupling (TE-TM mode splitting) results in non-zero Berry con- 
nections around the K and K’ points, contributing to the total band 
Chern number C= +2 (refs '”!8). As a consequence, the honeycomb 
structure supports one-way propagating edge states for the energies 
within the topological gap. Figure 1c demonstrates the results of a band 
structure calculation combined with the dispersion of the edge states 
localized at the zigzag edge of the honeycomb structure. The propa- 
gation direction of these edge states is related to the direction of the 
external magnetic field: the polariton edge current is either left-moving 
(L, yellow) or right-moving (R, red), depicted in Fig. la, depending 
on the sign of the magnetic field. Figure le depicts the corresponding 
calculated edge mode. 

To illustrate the existence and robustness of the topologically pro- 
tected one-way edge states, we simulate the evolution of a wavepacket 
excited locally (Fig. 1f, g; red circle) at the zigzag edge of the honey- 
comb structure. Figure 1f, g shows that the launched wavepacket starts 
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Fig. 1 | Experimental scheme, calculated band structure and simulated 
dynamics of the topological polariton edge modes. a, Schematic of the 
non-resonant laser excitation of left-moving (yellow) and right-moving 
(red) chiral topological polariton edge modes in a magnetic field H,. 
b, c, Trivial and topological band structures of the polariton honeycomb 
lattice for zero Zeeman splitting (b) and with Zeeman splitting of A.f= 0.2 meV, 
induced by the external magnetic field (c). One-way topological edge 


to propagate left or right along the edge, depending on the polarity 
of the magnetic field, whereas in the absence of magnetic field, the 
launched wavepacket remains at the excitation point. Note that the 
overall intensity decreases with propagation, as the model takes into 
account a realistic polariton lifetime of about 35 ps (see Methods for 
details). The simulations reveal that the chiral edge mode is topologi- 
cally protected, as it propagates along the 90° corner and is able to pass 
a point-like defect without scattering (see Methods). 

Having established the features that we expect to observe in an exci- 
ton-polariton topological insulator, we describe the experimental plat- 
form. To realize a honeycomb potential lattice, we fabricate a planar 
microcavity containing three Ino.94Gao.96As quantum wells in a \-cavity, 
sandwiched between two GaAs/AlGaAs distributed Bragg reflectors 
(DBRs) (Fig. 2a). Subsequently, we use electron beam lithography 
to define the honeycomb lattice, using micropillars with a diameter 
of d=2.0 pm and a pillar-to-pillar overlap of v=a/d=0.85, where a 
denotes the centre-to-centre distance between neighbouring pillars. 
Finally, the upper DBR is etched in such a way that only two DBR pairs 
of the top DBR remain, so as not to damage the active quantum-well 
region (Fig. 2b, c) (see Methods). When these sites are arranged in a 
two-dimensional lattice, the discrete pillar modes hybridize because of 
their proximity to one another and form a polaritonic band structure. 
The honeycomb lattice is characterized by a two-element base in real 
space (Fig. 2d) and two non-equivalent K and K’ points supporting 
Dirac cones in the first Brillouin zone (Fig. 2e), as well known from 
graphene. Figure 2f depicts characterization of the polariton honey- 
comb lattice in the linear regime, using non-resonant laser excita- 
tion. The Fourier-space energy-resolved photoluminescence of the 
investigated lattice is imaged in the k, direction and scanned in the 
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modes are represented by red (right-moving) and blue (left-moving) data 
points within the topological gap. d, e, Schematic (d) and calculated (e) 
intensity profiles of the edge modes. f, g, Calculated propagation dynamics 
of edge modes injected coherently (red circles) into the topological 

gap: left-moving time sequence (yellow) for negative Zeeman splitting 

Ap =—0.8 meV (f) and right-moving propagation (red) for positive 
Zeeman splitting Ag = +0.8 meV (g). arb. units, arbitrary units. 
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k, direction. The blue data points are fitted to the measured disper- 
sions, accurately revealing the six Dirac cones at the K and K’ points. 
The results of the corresponding tight-binding model (see Methods 
for details) are plotted in red and brown, agreeing well with the exper- 
imental data. 

Next, we describe the experiments conducted on the honeycomb 
exciton-polariton lattice under an external magnetic field, aiming to 
find topologically non-trivial edge states. To be able to observe a band- 
gap that opens at the Dirac points, the Zeeman splitting and the TE- 
TM splitting at the Dirac points need to be sufficiently large compared 
with the photoluminescence linewidth. This implies that the polaritons 
need to have a sufficient excitonic part. Therefore, we select a lattice at 
a moderate negative exciton-to-photon detuning of 6= —11.5 meV as 
summarized in the Methods. 

To assess the size of the bandgap, we apply polarization-resolved 
spectroscopy, making use of a \/4 polarization series at an external 
magnetic field of B=0 T and 5 T. At an external magnetic field of 
B=+5 T, the Hopfield coefficients at the Dirac points yield a pho- 
tonic fraction of |C|? =0.96 and an excitonic fraction of |X|? =0.04. 
Furthermore, the TE-TM splitting at the wavevector of the Dirac point 
(that is, |kp| 0.777/a) can be estimated to be 400 j1eV for the photons, 
resulting in an effective TE-TM splitting for the exciton-polaritons 
of about 384 eV (Ger 263 jxeV jum’; see Methods). The Zeeman 
splitting of the excitonic mode is determined to be about 540 eV, 
leading to an effective Zeeman splitting of Agg-22 eV at the Dirac 
point (see Methods). As the TE-TM splitting is considerably larger 
than the Zeeman splitting in the lattice studied here, the experimen- 
tally determined bandgap that opens because of the magnetic field is 
A, = 108 + 32 peV with Acr< A, < Ges, which, as we show later, is 
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Fig. 2 | Lattice device layout and geometry. a—c, Scanning electron 
microscope images of the processed polariton honeycomb lattice. 

a, Cleaved cross-section of the microcavity before processing. b, Tilted 
view of the half-etched honeycomb lattice for pillars with a diameter of 
d=2.0um and an overlap v=a/d=0.85. c, Cleaved cross-section after 
etching, showing that only the top DBR has been etched. d, Real-space 


reasonable for observing the topological features of our lattice and com- 
pares well with a gap size of about 100 eV found for realistic system 
parameters”. 

After having established that a bandgap opens at the Dirac points 
under the influence of an external magnetic field, we use non-resonant 
excitation, under conditions allowing for polariton condensation into a 
chiral edge state within this bandgap. Scans of the photoluminescence sig- 
nal in real space as well as Fourier space are performed on the polariton 
lattice at external magnetic fields of B= —5 T, 0 T and +5 T: The sample 
is excited on the zigzag edge by a pulsed and chopped laser beam with a 
diameter of 40 jum at 792 nm wavelength, tuned to the first stopband min- 
imum (see Fig. 1a). In Fig. 3a and b, the linear polariton dispersion in the 
Kk’ >I — Kdirection at B=0 T and at B= +5 T is displayed. The Dirac- 
cone dispersion is clearly visible (P=+0.1 mW). At a threshold power of 
Py, = 1.8 mW, we observe a nonlinear increase of the output power as well 
as a sudden decrease in linewidth (see Methods for details), establishing 
that a polariton condensate has formed at the K and K’ points at around 
k= £0.777/a, as displayed in Fig. 3c. We now perform mode tomography, 
scanning the real-space (x, y) landscape and measuring the photolumines- 
cence energy Ep. Figure 3d shows the spectrum ofa line perpendicular to 
the zigzag edge, taken at the position indicated by the solid white line in 
Fig. 3g. The dashed white line depicts the edge of the sample. Besides an 
S-band condensate throughout the excited structure at Es = 1.4674 eV, we 
observe an edge mode: a region of high intensity residing only at the out- 
ermost row of lattice sites spatially and spectrally at an energy of 1.4678 eV 
(white ellipse in Fig. 3d)—the expected topological bandgap. 

Figure 3e shows the intensity pattern integrated over the energetic 
range of the trivial S-band mode centred at 1.4674 eV. Clearly, the con- 
densate is relatively homogeneous over a large fraction of the lattice. 
The white overlaid lattice geometry and the microscopy image insets 
illustrate the position of the edge of the sample. Now, by changing the 
energy of the mode tomography to an energy Ezdge = 1.4678 eV, within 
the topological gap (under a magnetic field of +5 T), the existence of 
edge states becomes unequivocal (at x~ 8 jum in Fig. 3g). The photolu- 
minescence at this energy originates predominantly from the outermost 
row of lattice sites, with almost no emission detected from the bulk of 
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k, (n/a) 


ky (t/a) 


honeycomb unit cell with two-element basis. e, First Brillouin zone of 
the honeycomb lattice, featuring the two non-equivalent K and K’ points. 
f, Measured Fourier-space energy-resolved photoluminescence of the 
investigated lattice. The blue data points are fitted to the measured 
dispersions, agreeing well with a tight-binding model (red and brown), 
accurately revealing the six Dirac cones at the K and K’ points. 


the lattice. The mode is in excellent agreement with the Bloch-mode 
calculations in Fig. 1e-g. In addition, the theoretical description within 
a Ginzburg-Landau-based model confirms that polariton condensation 
into the edge mode occurs (see Methods for details). 

We now move on to study the robustness of the topological edge 
state. We do so by installing an artificial defect into the lattice at 
y= 15m (white circle in Fig. 3e-g). The defect is formed by leaving 
one of the sites on the zigzag edge of the honeycomb lattice unoccupied 
(see Methods). Normally, such a strong defect would cause scattering 
into the bulk, but here (see Fig. 3f, g) such scattering is suppressed, indi- 
cating that the transport of the topological edge state is immune to such 
defects. In addition, we perform mode tomography using non-resonant 
excitation with a large spot at the corner position of the sample. When 
plotting the energy of the topological edge mode Ezdge = 1.4678 eV in 
Fig. 3f, one clearly observes that the mode extends around the corner 
from the zigzag into the armchair configuration, without any sign of 
backscattering or bulk scattering. The measurements at B= —5 T show 
very similar behaviour, but the transport is in the opposite direction. 
When the magnetic field is absent (B =0 T), the edge mode vanishes 
completely (see Methods for details). The observation of the edge 
mode around the corner and especially its existence at the armchair 
edge, where topologically trivial honeycomb lattices do not have edge 
modes””’, prove that the edge mode we observe is indeed topological 
and is endowed with topological protection. 

For further insight into the nature of these edge states, we analyse 
hyperspectral images ((k,, ky) versus Ep,) to identify the dominant 
propagation direction, with and without magnetic field. The results 
are displayed in Fig. 4. While in real space the modes can be clearly 
separated in energy, the integration over Fourier space results in 
the topological edge mode and the trivial bulk modes overlying one 
another. Figure 4a, b depicts the integrated intensities of the full S-band 
condensate (1.467-1.468 eV), including the energies associated with 
the edge state, for experiments at B= +5 T and —5 T, respectively. To 
analyse the directionality of polariton transport, the maximum peak 
intensities at the two maxima at k, +0 and ky = +0.777/a are extracted 
by identifying the central pixel of the peak and averaging the intensity 
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Fig. 3 | Photoluminescence measurements of a polariton condensate in 
a topological edge mode. a, Polariton dispersion along the K’ ~ T > K 
direction at B=0 T compared with the calculated Bloch-mode model 
(black dots). b, c, Equivalent dispersions at B= +5 T, below (b) and 
above (c) the threshold Pj, = 1.8 mW, where condensation into the 

Kand K’ points of the S band is observed. Bloch-mode calculations (red 
dots) show a distinct gap. d, Real-space spectrum in the x direction along 
the straight white line in g. The x axis in d, e and g is the same. A trivial 
S-band condensate throughout the structure and a mode (E = 1.4678 eV) 


of a region of 3 x 3 pixels centred around this position. The vertical 
axis of Fig. 4c shows the ratio of the luminescence travelling in one 
direction (++k,) to that in the opposite direction (—k,). Deviation of this 
quantity from unity is an essential characteristic of a chiral state, and an 
opposite deviation should appear for opposite applied magnetic field. 
The corresponding intensity ratios are plotted in blue and show a clear 
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well separated from the bulk, located at the zigzag edge (dashed white 
line), are observed. e, Mode tomography displaying a homogeneous 
trivial S-band condensate (Es = 1.4673-1.4675 eV) within the pump spot 
diameter of 401m. The inset shows a microscopy image of the structure. 

f, g, Mode tomography of the topological edge mode (Ezage = 1.4678 eV) 
at the corner position of the sample (f) and at the same position as in e (g). 
The mode is well located at the zigzag edge and clearly extends around the 
corner to the armchair configuration. 


directionality when an external magnetic field is applied. The transport 
changes its predominant direction along the edge when the direction 
of the magnetic field is inverted. This observation supports the inter- 
pretation of the edge-states being a result of a topologically non-trivial 
bandgap, with the edge mode contributing to the chirality along the 
edge. Reversing the magnetic field reverses the slope of the dispersion 


Fig. 4 | Chirality and propagation of the 
condensate. a, b, Spectroscopic hyperspectral 
= measurement of the full S-band condensate 
(1.467-1.468 eV) at the K and K’ points, 
including the energies linked to the 
topological edge modes for B= +5 T (a) and 
B=—5T (b). The zigzag edge is oriented in 
the y direction. c, Polariton intensity ratio 
between the K’ and K points in the k, direction 
(k, 0) as a function of the applied magnetic 
field. The dominant propagation direction 
| is inverted (yellow and red arrows) when the 
direction of the magnetic field is reversed. For 
| B=OT, no dominant propagation direction is 
observed. The error bars originate from image 
distortions, inhomogeneities of the excitation 
and uncertainties during data processing, and 
5 are estimated at 5%. 


Magnetic field, B (T) 
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curve of the topological edge mode, which is physically manifested in 
reversing the group velocity. On the other hand, we find no systematic 
directionality for the peaks at k, —0.771/a in Fig. 4a, b, which implies 
that these arise solely from the bulk condensate. 

The experimental results depicted in Figs. 3, 4 unequivocally prove 
the observation of an exciton-polariton topological Chern insulator. 
The application of a magnetic field to the honeycomb lattice opens 
up a topological bandgap, with topological edge states supporting 
unidirectional transport whose propagation direction is determined 
by the field polarity. The lack of scattering from an artificial defect 
manifests the robustness of the topological edge mode. Furthermore, 
the observation of the edge mode extending around the corner and at the 
armchair termination without bulk scattering is a distinct feature of 
the topological edge mode, highlighting its topological protection 
against defects and disorder. Our results lead the way to efficient light 
trapping and topologically protected propagation of coherent exciton- 
polariton condensates in a well-developed semiconductor platform in 
which electrical driving can be envisaged*”. We now aim to explore the 
topological lasing aspect of these experiments further, by comparing 
topological edge-mode lasing with lasing from a trivial edge mode in, 
for example, a Semenoff insulator. Such experiments would also link 
our exciton-polariton platform to the recently observed topological 
insulator laser*!*”. Because of the interacting nature of polaritons, 
in-depth study of collective bosonic effects in topological insulators 
can be envisaged. For example, the large nonlinearity displayed by this 
exciton-polariton topological system can support the observation of 
solitons in topological insulators, which have been proposed**# but 
not yet observed in any system. This work is a step towards new topo- 
logical polaritonic devices with properties and functionalities involving 
nonlinearity, gain, interactions and coherence. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
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METHODS 


Basic sample characterization. Honeycomb lattices of coupled micropillars 
were etched into a planar semiconductor Fabry-Pérot microcavity, grown 
by molecular beam epitaxy. The cavity consists of a GaAs \-cavity equipped 
with three 16-nm-wide Ino 94Gao.9sAs quantum wells sandwiched between 
two distributed Bragg reflectors with 30 (35.5) Alo.19Gao 90As/AlAs top (bottom) 
mirror pairs. The quantum wells were placed at the maximum of the electromag- 
netic field, resulting in a strong exciton-photon coupling with a Rabi splitting of 
4.3 meV (see Extended Data Fig. 1). The set of quantum wells emit with an exciton 
linewidth of yx= 1.2 meV (full width at half maximum), measured with the top 
mirror etched away. The quality factor of the cavity was determined experimentally 
by measuring the mode linewidth of a single large pillar at a negative (photonic) 
detuning, 6 ~ —11.5 meV, to be about 13.000 (yc=0.11 meV). The overall layer 
thicknesses decrease radially towards the outside of the wafer, affecting mainly 
the photonic mode. The decrease in thickness allows the experimenter to choose 
a certain exciton-to-photon detuning. By increasing the asymmetry between the 
cavity mode and the stopband centre, the TE-TM splitting of the cavity mode 
is increased, mimicking a spin-orbit interaction*®. However, the further the 
cavity mode is moved towards the edge of the stopband, the lower the reflec- 
tivity becomes, implying a lower Q-factor and larger polariton linewidth. Based 
on simulations and experience from previously grown samples, a DBR (cavity) 
asymmetry factor of 1.03 (0.91) was chosen for the microcavities presented in 
this work (1.00 (1.00) would be the symmetric case), yielding a TE-TM splitting 
of approximately 600 j1eV at kj = 2.0m! (see Extended Data Fig. 2). Both the 
indium content in the InGaAs quantum well and the quantum-well thickness 
were optimized with regards to the smallest linewidth and largest Zeeman split- 
ting. 4% indium and 16 nm thickness yield a Zeeman splitting AEz = 540 jeV 
at B=5 T (see Extended Data Fig. 2). After sample growth, a honeycomb lattice of 
approximately (120 x 120) um? with a pillar diameter of d= 2.0m and an over- 
lap v=a/d=0.85 was defined using an electron beam lithography process and 
subsequent wet etching. Extended Data Fig. 3 shows a microscope image of an 
intended defect in the form of a site missing in a zigzag edge of the honeycomb 
lattice (indicated by the red arrow). 

Photoluminescence experiment. Non-resonant photoluminescence experi- 
ments with and without an applied magnetic field were carried out using a lin- 
early polarized, pulsed titanium-sapphire laser with a repetition rate of 82 MHz 
and a pulse length of approximately 2 ps. The wavelength of the laser was set to 
be A, =792 nm, coinciding with the first high-energy stopband minimum of the 
microcavity structure. 

The emission was collected using a microscope objective (20x, NA =0.4) and 
imaged at the entrance slit of a Czerny-Turner spectrometer, equipped with a 
charge-coupled device (CCD) camera with a resolution of about 20 j1eV. A motor- 
ized imaging lens allows for automated hyperspectral images ((k,, ky) versus E) and 
mode tomographies ((x, y) versus E). 

The sample was mounted in a liquid helium flow cryostat (Oxford Instruments 
Microstat), operating at a temperature of T=4 K. Using superconducting coils, a 
magnetic field B=—5 T to +5 T can be applied in Faraday geometry. 
Bloch-mode calculations for polariton honeycomb lattices. A widely accepted 
model describes the dynamics of excitons with spin-up (7) and spin-down (w") 
coupled to cavity photons carrying the right (E*) and the left (E~) circular polar- 
izations, respectively'®**°, and is governed by 


he 


ihO,E* = | -——V?. +. V(r) + wei |E* 
2mc (1) 
+ B(O, Fid,)7E* + hQgy* +E, *P 
: + 4 Ap |.) + + 
iho = a, Wy +hQRE (2) 


(where 0; represents partial derivative with respect to time ¢, and E, is the external 
coherent radiation explained below). Here, the normalization is such that |E*|? and 
|W*|? are the number of particles per unit area. The quantities we and wy represent 
the energies of bare photons and excitons, respectively. In the present configu- 
ration, the photon-exciton detuning is negative, 6= (wc — wg) = —6 meV. The 
photon-exciton coupling strength is given by the parameter Qa, which defines the 
Rabi splitting as 2HOg = 4.5 meV. Here, mc= 32.3 x 10° °me is the effective photon 
mass in the planar region, and m, is the free electron mass. The effective mass of 
excitons is mg 10°mc. An external photonic potential V(r) is defined within the 
unit cell of the honeycomb structure, constructed of circular mesas (micropillars). 
We assume that the potential is V(r) =30 meV outside the mesas and zero other- 
wise. In what follows, we assume that the intensity of polaritons is weak enough 
and thus neglect nonlinear interactions between them. The TE-TM splitting of 
the cavity modes gives rise to the linear coupling between right- and left- circular 
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polarizations and is denoted by (ref. *°). We account for the magnetic field via 
Zeeman splitting Ag of the excitonic states in the quantum wells. 

Equations (1) and (2) allow for accurate simulation of the propagation of 
the chiral modes coherently injected into the system by means of the external 
coherent radiation (E+), with an appropriately chosen frequency (wp) within the 
topological gap. First, we calculate the energy-momentum band structure of the 
honeycomb lattices, using a full description of the Bloch modes that takes into 
account all relevant system parameters. For this aim, we solve the following eigen- 
value problem for the energy hju(ky) of the Bloch mode with the Bloch vector 
ky = (kpxs Kpy) 


ay ,k wae. - pk + ,k 
mudeg| 2 OPP a|E Pash he (3) 
PL(ok)} (CL Jp, oy) 


The circularly polarized polaritonic wavefunctions are p*(r,k,)= 
(cg (r, ky), xg (1, k,)), where the functions ¢;(r, k,) and x;(r, k,) describe the 
amplitude distributions of the photonic and excitonic component of the Bloch 
modes in real space, defined in the plane of the microcavity r= (x, y). The diagonal 
of the matrix in equation (3) describes the single-particle coupled states of excitons 
and photons and is given by the expression 


2 
we+ V(r)— A (V, + ik,)? hQz 
at 2mMc 
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h? ag es 
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at 
The coupling between both polarization components is given by the matrices C 


Ct _ |AIM, + ky) -(e, Fie)? 0 
0 0 


(5) 


which includes the TE-TM splitting of the photonic modes. Ag describes 
the Zeeman splitting of the excitons due to the applied magnetic field. The unit 
vectors ey, shows the directions of Cartesian coordinates (x, y) in the plane of the 
microcavity. To reduce computational efforts greatly, it is convenient to solve the 
eigenvalue problem in polaritonic basis after diagonalization of the matrix L, 
describing the coupling between photons and excitons. In this case, the effective 
parameter for the TE-TM splitting Ge G|C|? and the Zeeman splitting 
Aegp* Apg|X|? are scaled with the photonic and excitonic components of the polar- 
itons, respectively (where |C|? and |X/? are the Hopfield coefficients, calculated for 
the respective Bloch mode). 

Physically, the excitation occurs by coherent illumination with linearly 
polarized light at the frequency within the topological gap. The seeding pulse 
duration is about 40 ps. Other parameters: Ap = +0.8 meV, 3=0.20 meV jum?, 
Ger=0.15 meV um’, y=0.01 meV, polariton lifetime T= 35 ps, pillar diameter 
d=2.0,m, centre-to-centre separations a= 21m. 

Polariton chiral edge mode propagating around a corner and defect. Using the 
Bloch mode calculations described above, we now excite the system resonantly 
and calculate its evolution in time for a Zeeman splitting of Ag = +0.8 meV. The 
polaritons propagate along the edge and around the 90° corner as expected for a 
topological edge mode (see Extended Data Fig. 4). In the same way, we find that 
the edge mode avoids an artificial defect in the form of a site missing in a zigzag 
chain (see Extended Data Fig. 5). 

Tight-binding model. A tight-binding model describing the artificial graphene 
band structure is given by 


E,c(k) =E,+t,./3+ fk — t'f(kp (6) 


3 


f(ky) = 2cos(/3 kya) + scos| a cos [Fe (7) 


Here, t is the nearest neighbour, ¢’ is the next-nearest neighbour and ky is the meas- 
ured in-plane lattice vector, which is linked to the measured polariton emission 
angle 0 by ky =(w/c)sin@. The resulting band structure of this model is presented 
as a fit to the data in Fig. 2f, in red and yellow-brown. 

Gap measurement. When an external magnetic field is applied to this lattice, a 
bandgap is predicted to open at the Dirac points. We make use of a \/4 polar- 
ization series to detect this gap. In Extended Data Fig. 6, exemplary images of 
the \/4-series at external magnetic fields of B=0 T and 5 T are presented. The 
energetic position of the Dirac point was evaluated by fitting a Lorentzian peak 
profile to the line spectrum through the Dirac point at K’. The peak positions 
were plotted against the angle of the \/4-waveplate. Assuming that the position of 
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the peak corresponds linearly to its polarization, the resulting graph can be fitted 
using the equation 


(¢) = ls + S,cos*(2#) + S,sin(24)cos(24) + S,sin(2¢)] 


normally used to fit the peak intensity in a \/4-series with a constant peak position. 
At B=+5 T, a bandgap of E, = 108 + 32\1eV was evaluated. The peak movement 
with no external magnetic field applied can be attributed to imperfections of the 
\/4-waveplate. It was used to indicate the uncertainty of the bandgap that opens 
when an external magnetic field is applied. 
Input/output characteristics and linewidth. By increasing the power of the 
non-resonant pump laser, measuring the intensity and linewidth we determine 
the threshold characteristic of the polaritons as shown in Extended Data Fig. 7. 
Ata typical threshold of Pi, + 1.8 mW a distinct nonlinear increase in intensity as 
well as a sudden decrease in linewidth can be observed. The mode associated with 
the gap mode becomes visible around P= Pip. 
Ginzburg-Landau calculations of polariton condensation into a topological 
edge mode. In theoretical proposals such as in refs !*!8, no particular form of 
excitation was considered explicitly. It is clear from the theoretical Bloch mode cal- 
culations that a resonant excitation is able to directly populate the various modes. 
For the case of non-resonant excitation, further theoretical work is needed to 
access the physics of polariton condensation into a chiral topological edge mode, 
described in Fig. 3. 

Here we use the driven-dissipative Ginzburg-Landau model, frequently used 
to describe the spatial form of polariton condensates: 


‘ A a 7 
Sy [Lo + ily? yg ik)” 
noi 2 + 7 - (* (8) 
wae et ikyy? Lp —ibw "|" 


Here yw. and w_ denote the two spin components of the two-dimensional polari- 
ton wavefunction. The operator Le is defined as Te = Ey + V(r) +iW(r), where 
the operator E, corresponds to the bare kinetic energy of polaritons, modelled with 
an effective mass m. The potential V(r) accounts for the honeycomb lattice struc- 
ture and Ag¢ is the Zeeman splitting caused by the applied magnetic field. The two 
spin components are coupled by the off-diagonal spin-orbit coupling term, phys- 
ically corresponding to a TE-TM splitting with strength Gor. 

The linear gain and loss in the system is described by the term W(r) = 
P(r) — I(r), where P(r) represents the spatially dependent non-resonant pumping 
and I(r) is a spatially dependent dissipation rate. We assume that the loss is higher 
outside the micropillar regions. For the system to form a steady state, it is impor- 
tant to consider also nonlinear loss terms. For simplicity, we neglect the effect of 
polariton-polariton interactions, which is valid provided we operate not too far 
above the condensation threshold. 


Before solving the full nonlinear problem, it is instructive to consider the spec- 
trum of linearized modes, obtained by neglecting the nonlinear term. Considering 
a strip geometry, where the potential is periodic in the x direction and bounded 
in the orthogonal y direction, the Bloch theory can be used to obtain the complex 
band structure of the system eigenmodes with respect to a wavevector k,. The 
result is shown in Extended Data Fig. 8a. For parameters comparable with the 
experiment, we obtain a topological bandgap bridged by chiral edge states. A slight 
asymmetry in the edge states occurs owing to the pumping P(r), which is taken 
to be periodic in the x direction (to maintain validity of the Bloch theory) but 
preferentially localized near the bottom edge of the strip. 

Extended Data Fig. 8b shows the imaginary parts of the same eigenmodes. The 
chiral edge state is found to have the largest imaginary component, meaning that 
it has a larger gain than other states and would be preferentially selected during 
polariton condensation. However, it should be noted that in principle there are 
an infinite number of energy states obtainable with the present theory, and it is 
necessary in practice to introduce an energy cut-off to solve the Bloch Hamiltonian. 

To rule out the potential population of higher energy states, we rely on solution 

of the full nonlinear problem, propagating equation (8) in time to a steady state. 
Here we do indeed find condensation in a chiral edge state, as illustrated with the 
colour scale in Extended Data Fig. 8a. A slight shift from the results of the linear 
band-structure calculation occurs because of the nonlinear term. Extended Data 
Fig. 8c shows the obtained wavefunction in real space, which is indeed localized 
at the edge excited by the pumping, agreeing with the experimental findings. In 
addition to explaining our experimental observation of chiral current under non- 
resonant pumping, these theoretical results predict a polariton lasing in a topolog- 
ical edge state: a topological polariton laser?!*. However, we point out that a strict 
comparison between a topological laser and a non-topological laser (as in ref. *”) 
cannot easily be made, as the topological gap cannot be closed without changing 
a wide range of other system parameters. 
Real-space mode tomographies at B= 0 T and B= —5 T. For the sake of com- 
pleteness, we perform the same mode tomographies at B=0 T and B= —5 T as 
were displayed in Fig. 3 for B= +5 T. The results are displayed in Extended Data 
Fig. 9a-d (B=0 T) and Extended Data Fig. 9e—h (B= —5 T). Whereas for the 
B=0 T case no distinct edge mode is observed, the behaviour for B= —5 T is 
qualitatively similar to that for B= +5 T, as expected. 


Data availability 

The research data that support this publication can be accessed at https://doi. 
org/10.17630/4a62cbdd-bcae-45d7-a556-3cda53c0a656. Additional data related 
to this paper may be requested from the corresponding authors. 
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Extended Data Fig. 1 | White-light reflectivity measurements as a 
function of the detuning. Reflectivity measurements are shown as a 
function of the detuning. A distinct anticrossing behaviour with a Rabi 
splitting of 2hQx = 4.3 meV can be observed. The measurements were 
performed on a sample piece with approximately 15 mirror pairs removed 
from the top DBR to increase the signal quality. Inset, fitted peak positions 
versus detuning. UP and LP stand for upper polariton and lower polariton. 
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Extended Data Fig. 2 | Zeeman splitting and TE-TM splitting for a experiment. Inset, example of central emission energies for a \/4-series 
III-V microcavity hosting Ino,94Gao,9¢As quantum wells. Left, Zeeman with a sine fit. Right, experimentally determined TE-TM splitting at 
splitting with regard to the magnetic field, including second-order various detunings Ag for sample A including fits with modified photonic 
polynomial fits as a guide to the eye. Sample A is the one used in this Hopfield coefficients (red). 
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Extended Data Fig. 3 | Microscopy image of a zigzag-edge polariton 
honeycomb lattice with an intentional defect. Shown is an image of the 
honeycomb lattice with d= 2.0 zm and v= 0.85, with an intentional defect 
in the zigzag chain, selected for experiments in an external magnetic field. 
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Extended Data Fig. 4 | Polariton chiral edge mode propagating around linearly polarized narrow coherent seeding beam injects both polarization 
a corner. a—c, Propagation dynamics of edge modes injected coherently components into the region marked by the red circle. At t= 100 ps, 
into the topological gap and calculated within model 1. Shown here is the the mode propagates around the corner from the zigzag edge into the 


right-moving propagation for the positive-value splitting Ap = +0.8 meV armchair one. 
(| Acs] =0.2 meV) and G=0.20 meV um? (Bef¢= 0.15 meV pm’). A 
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Extended Data Fig. 5 | Polariton chiral edge mode propagating 

and avoiding a defect. Propagation dynamics of edge modes injected 
coherently into the topological gap and calculated within model 1 

are shown for the right-moving propagation for the positive-value 
splitting Ay = +0.8 meV (|Acr| =0.2 meV) and 3=0.20 meV um? 

(Bere = 0.15 meV m7). A linearly polarized narrow coherent seeding beam 
injects both polarization components into the mesa. At t= 85 ps, the mode 
propagates around the defect in the zigzag chain, marked by the red circle. 
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Extended Data Fig. 6 | Topological gap measurement. A \/4-plate 
measurement at B=0 T (blue) and B= +5 T (red) at the K point yields a 
bandgap of E, = 108 + 32 1eV. PL, photoluminescence. 
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Extended Data Fig. 7 | Input/output characteristics and linewidth 
behaviour as a function of pump power. Below threshold, the gap and 
bulk mode cannot be distinguished. At a typical threshold Pj, + 1.8 mW, 
a distinct nonlinear increase in intensity as well as a sudden decrease in 
linewidth can be observed. Here, the populated gap modes show similar 
behaviour to the bulk mode. 


© 2018 Springer Nature Limited. All rights reserved. 


LETTER 


b C Iwi? 
[| 10.14 
0.12 
iil | 
: i 
E ri E 0.08 
2 > 0.06 
0.04 
0.02 
- 0 ; 5 0 5 
k (um"') k (um*') x (um) 
Extended Data Fig. 8 | Driven-dissipative Gross—Pitaevskii calculation to be populated with increasing pumping. c, Edge state obtained from 
of polariton condensation into topological edge mode. a, Band structure _ solution of the driven-dissipative Gross—Pitaevskii equation. Parameters: 
of polaritons in a honeycomb lattice. The dotted curves represent the Ae=0.3 meV, Berp— 0.2 meV zm’. The effective mass m was taken as 
dispersion of the linear eigenmodes of a strip, colour-coded to represent 1.3 x 10~4 of the free electron mass; the potential of depth 0.5 meV was 
localization on the bottom edge (red), upper edge (green) and in the bulk constructed from a honeycomb lattice of cylinders of radius 1 jum and 
(blue). The shaded region represents the energy and momentum of the centre-to-centre separation 1.7 |1m; the pump spot was taken as a Gaussian 
polariton steady state obtained from solving the driven-dissipative Gross— centred on the strip edge with extent 7.5 1m in the y direction. A spatially 
Pitaevskii equation. b, Imaginary components of the linear eigenmodes. uniform decay rate of 0.2 meV was supplemented with a 1.7 meV decay in 
The largest imaginary part corresponds to an edge state (the colour the region outside the cylinders. 


coding is the same as in a), suggesting that the edge state is most likely 
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Extended Data Fig. 9 | Real-space mode tomographies of a polariton 
condensate at B= 0 T and B= —5 T. a—d, Measurements at B= 0 T. 

a, Real-space spectrum in x direction perpendicular to the zigzag edge 
along the straight white line in d. The real-space x axis is consistent 
between a, b and d. The dashed white line marks the physical edge of 
the lattice. Only a trivial S-band condensate can be observed throughout 
the structure. b, Mode tomography displaying the topologically trivial 
S-band condensate at Es = 1.4673-1.4675 eV. A relatively homogeneous 
condensate within the pump spot diameter of 401m is observed. The 
inset shows a microscopy image of the structure. c, d, Mode tomography 
of the energy Ezdge = 1.4678 eV for comparison at the corner position (c) 
and at the edge (d) of the sample. Without magnetic field, no localized 
edge mode can be observed. e-h, Measurements at B= —5 T (fully 
analogous to Fig. 3d—g). e, Real-space spectrum in the x direction 
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perpendicular to the zigzag edge along the straight white line in h. The 
real-space x axis is consistent between e, f and h. The dashed white line 
marks the physical edge of the lattice. A trivial S-band condensate can be 
observed throughout the structure. At E= 1.4678 eV, again we observe the 
appearance of a localized mode, well separated from the bulk and located 
at the zigzag edge. f, Mode tomography displaying the topologically trivial 
S-band condensate at Es = 1.4673-1.4675 eV. A relatively homogeneous 
condensate within the pump spot diameter of 40 {1m is observed. The inset 
shows a microscopy image of the structure. g, h, Mode tomography of 

the topological edge mode at Ezdge = 1.4678 eV at the corner position (g) 
and at the edge (h) of the sample, showing clearly that the mode extends 
around the corner from the zigzag to the armchair configuration and 
avoids the intentional defect, both without bulk scattering. 
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In-plane anisotropic and ultra-low-loss polaritons 
in a natural van der Waals crystal 


Weiliang Ma!!, Pablo Alonso-Gonzaélez*""*, Shaojuan Li+"", Alexey Y. Nikitin®+, Jian Yuan!, Javier Martin-Sanchez?, 
Javier Taboada-Gutiérrez’, Iban Amenabar®, Peining Li°, Satil Vélez>°, Christopher Tollan°, Zhigao Dai’, Yupeng Zhang’, 
Sharath Sriram®, Kourosh Kalantar-Zadeh’, Shuit-Tong Lee!, Rainer Hillenbrand*>!°* & Qiaoliang Baol”* 


Polaritons—hybrid light-matter excitations—enable nanoscale 
control of light. Particularly large polariton field confinement and 
long lifetimes can be found in graphene and materials consisting of 
two-dimensional layers bound by weak van der Waals forces’? (vdW 
materials). These polaritons can be tuned by electric fields** or by 
material thickness”, leading to applications including nanolasers®, 
tunable infrared and terahertz detectors’, and molecular sensors®. 
Polaritons with anisotropic propagation along the surface of vdW 
materials have been predicted, caused by in-plane anisotropic 
structural and electronic properties’. In such materials, elliptic 
and hyperbolic in-plane polariton dispersion can be expected 
(for example, plasmon polaritons in black phosphorus’), the 
latter leading to an enhanced density of optical states and ray-like 
directional propagation along the surface. However, observation of 
anisotropic polariton propagation in natural materials has so far 
remained elusive. Here we report anisotropic polariton propagation 
along the surface of ~-MoOs, a natural vdW material. By infrared 
nano-imaging and nano-spectroscopy of semiconducting «-MoO; 
flakes and disks, we visualize and verify phonon polaritons with 
elliptic and hyperbolic in-plane dispersion, and with wavelengths 
(up to 60 times smaller than the corresponding photon wavelengths) 
comparable to those of graphene plasmon polaritons and boron 
nitride phonon polaritons* >. From signal oscillations in real-space 
images we measure polariton amplitude lifetimes of 8 picoseconds, 
which is more than ten times larger than that of graphene plasmon 
polaritons at room temperature!®. They are also a factor of about 
four larger than the best values so far reported for phonon polaritons 
in isotopically engineered boron nitride!! and for graphene plasmon 
polaritons at low temperatures’. In-plane anisotropic and ultra- 
low-loss polaritons in vdW materials could enable directional and 
strong light-matter interactions, nanoscale directional energy 
transfer and integrated flat optics in applications ranging from bio- 
sensing to quantum nanophotonics. 

Anisotropic optical materials exhibit numerous distinctive and 
non-intuitive optical phenomena such as negative refraction’, 
hyper-lensing’*, wave-guiding!’ and enhanced quantum radiation’®, 
which have been demonstrated typically with artificial hyperbolic 
metamaterials. However, further progress is limited by optical losses 
and the complexity of metamaterial fabrication’. 

The recent emergence of low-loss vdW materials opens the door 
to achieving anisotropic optical phenomena naturally, because their 
layered crystal structure leads to an intrinsic and strong out-of-plane 
(perpendicular to the layers) optical anisotropy®'®. Prominent exam- 
ples are hyperbolic phonon polaritons (PhPs)—infrared light coupled 
to lattice vibrations in layered polar materials—in hexagonal boron 


nitride (h-BN), which exhibit long lifetimes!’, ultra-slow propagation’? 
and hyper-lensing effects*””!. Interestingly, when the layers of a vdW 
material are anisotropic (that is, when the permittivities along ortho- 
gonal in-plane directions are different), the polaritons are expected to 
propagate along the layers with an in-plane anisotropic dispersion’. 
When the permittivities are different but of the same sign, the polari- 
tons possess an elliptic in-plane dispersion, in which the iso-frequency 
contours (slices in two-dimensional (2D) wavevector space (kx, ky) of 
constant frequency w) describe ellipsoids. When the signs are different, 
the polaritons possess an in-plane hyperbolic dispersion, in which the 
iso-frequency contours are open hyberboloids””. Only recently, PhPs 
with in-plane hyperbolic dispersion have been demonstrated by fabri- 
cating an artificial metamaterial out of h-BN flakes”’. 

Theory predicts polaritons with both in-plane anisotropies even for 
natural materials (without any nanostructuring) that exhibit an in-plane 
anisotropy of their electronic or structural properties: for example, hyper- 
bolic plasmons—light coupled to free carriers—in black phosphorus’ or 
in Weyl semimetals™. While being expected to provide fundamental 
insights into exotic material properties (for example, non-reciprocal 
Purcell enhancement”), they also bear application potential, including 
intrinsically non-reciprocal plasmon guiding”’, topological transitions in 
2D anisotropic plasmons” and directional nanoscale energy collimation”® 
(for use as planar and directional light emitters with on-chip integration). 
However, their experimental observation and verification has so far been 
elusive. Here we present the first (to our knowledge) images of in-plane 
elliptic and hyperbolic polaritons (more precisely, PhPs) that propagate 
with record-length lifetimes. We found them in thin slabs of a-phase 
molybdenum trioxide (a-MoQOs3), a natural vdW polar semiconductor. 
Phonon polaritons have been observed” only recently in a-MoO;, but 
their anisotropic propagation properties have not been described. 

The diagrams in Fig. 1a, b show the orthorhombic crystal struc- 
ture of a-MoO3, in which layers formed by distorted MoOg octahedra 
(Fig. 1a) are weakly bound by vdW forces”® and all three lattice con- 
stants (a, b and c) are different (Fig. 1b). Most importantly, «-MoO; has 
strong in-plane structural anisotropy, caused by the interlayer spacing 
of the (100) facet differing from that of the (001) facet by as much as 
7.2%, which leads to the highly anisotropic response” (Supplementary 
Information). Indeed, the different directional vibrations of the 
a-MoOs crystal structure yield two infrared ‘reststrahlen bands’ (RBs)°” 
between about 820 cm~! and 1,010 cm™!; in this range, the typically 
strong reflectivity between the transverse optical and longitudinal 
optical phonon frequencies (TOs and LOs, respectively) shows a large 
in-plane anisotropy (Supplementary Information). Thus, we can expect 
that in-plane anisotropic PhPs exist in this material. An optical micro- 
scopy image of the a-MoOs flakes and their typical Raman spectrum 
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Fig. 1 | Physical properties of «-MoOs3. a, Illustration of the 
orthorhombic lattice structure of layered «-MoO; (red spheres, oxygen 
atoms). The orthorhombic structure is based on bilayers of distorted MoO. 
octahedra stacked along the [010] direction via vdW interactions. The 
three possible positions of oxygen atoms are denoted Oj_3, and the unit 
cell is shown dashed. b, Schematic of the unit cell of a-MoO3; the lattice 
constants are a= 0.396 nm, b= 1.385 nm and c=0.369 nm. Blue spheres, 


are shown in Fig. lc, d, respectively. The latter shows”? characteristic 
peaks at 820 cm! and 996 cm”! associated with the lattice vibrations 
producing the RBs of a-MoO3. 

To explore the polaritonic response of a-MoOs, we performed polar- 
iton interferometry using scattering-type scanning near-field optical 
microscopy (s-SNOM, Fig. 2a). A vertically oscillating metallized 
atomic force microscopy (AFM) tip is illuminated with p-polarized 
infrared light of frequency w and field E;,. while scanning an a-MoO; 
flake. Acting as an infrared antenna*®, the tip concentrates the incident 
field at its very apex to give a nanoscale infrared spot for local probing 
of material properties and for exciting polaritons. The tip-scattered 
radiation is recorded simultaneously with topography, yielding nano- 
scale resolved near-field images (Methods). Specifically, the polaritons 
(described by the field E and wavelength A) excited by the tip propagate 
away and are back-reflected at the flake edges, giving rise to interference 
fringes with a spacing /2. 

Figure 2b shows s-SNOM near-field amplitude images of an a-MoO3 
flake with thickness d=250 nm taken at w=990 cm™! and 
w=900 cm™', both frequencies residing*’ inside the two RBs of 
o-MoOs. For w=990 cm! (top image) we observe bright fringes par- 
allel to all the flake edges. They strongly resemble PhPs, similar to what 
has been observed in s-SNOM experiments on other polar materials° 
and recently on*’” a-MoO3. We observe that the fringe periodicity 
largely depends on the propagation direction, being \,,=950 nm and 
Ay=1,200 nm for the [100] and [001] crystal directions (Supplementary 
Information), respectively. Apart from the deep subwavelength-scale 
polariton confinement A, ,<)=11.1 jum (where Ao is the wave- 
length of the illuminating infrared light), this finding reveals a strongly 
anisotropic in-plane propagation (along the flake). This anisotropy 
becomes even more marked at w=900 cm7! (Fig. 2b lower image), 
where the fringes are seen only parallel to the [001] direction. 


558 | NATURE | VOL 562 | 25 OCTOBER 2018 


Intensity (a.u.) 


Bi oCe eee 


b = 1.385 nm ! eel 


1 n 1 mt 1 nq 
800 900 1,000 1,100 
Raman shift (cm-') 


0 . n 
600 700 
molybdenum atoms. c, Optical image of «-MoO; flakes. The a-MoO; 
crystals typically appear to be rectangular owing to the anisotropic crystal 
structure. Labelled arrows indicate crystal directions. Scale bar, 20 1m. 

d, Raman spectrum taken in the area marked by a red dashed circle in c. 
Red frequency labels indicate the Raman peaks associated with the lattice 
vibrations producing the RBs of a-MoO3. 


For unambiguous verification of the anisotropic polariton 
propagation, we recorded spectroscopic line scans® (Methods) along 
the [100] and [001] in-plane crystal directions (Fig. 2d, left and 
right panels, respectively). We observe two spectral bands exhibit- 
ing a series of signal maxima (fringes). The band limits (indicated by 
the horizontal dashed lines) correspond to the LO and TO phonon 
frequencies of a-MoO3 (denoted by LO, LO2, TO; and TO2) and 
thus reveal the upper and lower RBs (U-RB and L-RB, respectively). In 
the U-RB we find that the fringe spacing (corresponding to the polar- 
iton wavelength) along both the [100] and [001] directions increases 
with increasing frequency, indicating a negative phase velocity (anal- 
ogous to PhPs in the lower type-I RB of h-BN"’). As in Fig. 2b, we 
observe a slightly different fringe spacing for the [100] and [001] 
directions, but now for all frequencies between TO) and LO>. A very 
different behaviour is observed for the L-RB. Along the [100] direction 
we see fringes whose spacing decreases with increasing frequency, 
manifesting polaritons with positive phase velocity. More importantly, 
along the [001] direction we do not observe signal oscillations at a 
fixed frequency for the whole spectral range between TO, and LO}. 
This finding indicates the absence of PhPs propagating in the [001] 
direction, supporting our assumption of a hyperbolic in-plane dis- 
persion. The horizontal fringes observed in Fig. 2d (right panel) are 
caused by polaritons propagating along the [100] direction. Note that 
a line profile for a fixed w corresponds to a vertical line profile (along 
the [001] direction, and thus parallel to the interference fringes) in the 
lower panel of Fig. 2b, where we can see that PhPs are launched by the 
left edge of the flake. Depending on w and on the distance between 
the tip and the left flake edge, we thus observe either a constantly bright 
or dark contrast when the tip is scanned along the [001] direction, 
corresponding to a bright or dark horizontal fringe in the right panel 
of Fig. 2d. 
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Fig. 2 | Real-space imaging and nano-spectroscopy of an a-MoQ; flake. 
a, Schematic of the s-SNOM experimental configuration used to image an 
a-Mo0O; flake. A metallized AFM tip (yellow) is illuminated by 
p-polarized infrared light of frequency w and electric field Enc. It launches 
polaritons, which are back-reflected at the flake edges and subsequently 
scattered by the tip. The tip-scattered field E,., is detected by a distant 
detector. b, Near-field amplitude images s4 (Methods) of an a-MoO; flake 
with thickness d= 250 nm at illuminating frequencies w= 990 cm“! (top 
panel) and w =900 cm! (bottom panel). Scale bars, 2 um. c, Dispersion 
of PhPs along the [100] and [001] directions in the U-RB (top panel) and 


For a better understanding and quantitative analysis of the aniso- 
tropic polariton propagation, we extracted the PhP dispersions, w(k;) 
(i=x, y), from monochromatic s-SNOM images (not shown) of the 
flake in Fig. 2b. The dispersions for both crystal directions and RBs are 
plotted in Fig. 2c. For the U-RB (upper panel), the PhP dispersions 
along both crystal directions are similar, although slightly separated 
from each other (that is, for the same frequency w, we measured differ- 
ent wavevectors k;). This result verifies that PhPs in the U-RB propagate 
with in-plane anisotropy. By plotting the complex-valued wavevector 
of the PhPs, we find that their phase velocity, vp,;= w/k;, is negative 
along both directions, which is indicated by negative k; values. 
Furthermore, the remarkably small slopes of the dispersion curves 
(Supplementary Information) yield unprecedentedly small group veloc- 
ities (v, ,=(Ok,/Ow)~') of about 0.8 x 107%c (at w=985 cm™'), which 
in the future could be exploited for strong light-matter interaction 
experiments*!. For the L-RB (Fig. 2c lower panel), we only display the 
dispersion of the PhPs for the [100] direction, as no PhPs are observed 
in the orthogonal [001] direction. In this case, the phase velocity is 
positive (indicated by positive k, values), and the group velocity is about 
0.7 x 10~*c (at w= 893 cm~), which is comparable to that of ultra-slow 
PhPs in h-BN". 

To quantify the anisotropy of the PhPs and to measure their iso- 
frequency contours in wavevector space, we analyse PhP propagation 
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L-RB (bottom panel, see text). Grey lines in both panels are guides to 

the eye. Grey shaded areas indicate the spectral regions outside the RBs. 

d, Bottom row, nano-FTIR spectral line scans along [100] and [001] 
(directions shown as arrows in the bottom panel of b), showing the 
near-field amplitude s) (normalized to the near-field amplitude on the 
SiO, substrate, s 2,810) as a function of distance between tip and flake edge. 
Dotted lines mark the approximate longitudinal and transversal phonon 
modes in a-MoO3 (TO), 820 cm~!; LOy/TO:, 963 cm~!; LOs, 1,003 cm~!). 
Top row, zooms into the boxed areas of the U-RB shown in the bottom 
row. 


along all possible directions on the flake. To that end, we fabricated 
disks of a-MoO3 (Methods) and performed polariton interferometry 
experiments (Supplementary Information) analogous to those reported 
in Fig. 2. Figure 3a, b shows typical near-field amplitude images taken 
at frequencies respectively in the U-RB (w= 983 cm!) and in the 
L-RB (w=893 cm!) of «-MoOs. In the U-RB, the interference pat- 
tern shows an elliptical shape with the largest PhP wavelength along 
the [001] surface direction, which continuously reduces to its smallest 
value along the orthogonal [100] surface direction. More strikingly, 
in the L-RB the interference pattern manifests as an almond shape, in 
which the PhPs have their largest wavelength along the [100] direction 
and which continuously reduces to zero until no discernible polariton 
propagation occurs along the orthogonal [001] direction. By Fourier 
transform of Fig. 3a, b, we obtain the iso-frequency contours directly. 
We find an ellipsoid in the U-RB (Fig. 3c) and a hyperbola in the L-RB 
(Fig. 3d), revealing that the PhPs exhibit elliptic and hyperbolic disper- 
sions, respectively. Note that Fig. 3c shows two ellipses instead of one, 
differing by a factor of 2 in their semi-axes. We attribute this observa- 
tion to the presence of both tip- and edge-launched PhPs*”*? in Fig. 3a 
(Supplementary Information). On the other hand, the hyperbola in 
Fig. 3d opens along the [001] direction, which indicates that PhPs along 
this crystal direction are forbidden, thus explaining the observations 
in Figs. 2, 3b. 
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Fig. 3 | In-plane elliptical and hyperbolic PhPs in an «-MoO; disk. 

a, b, Near-field amplitude images sq of an «-MoO; disk (colour key at 
right) with d= 144 nm. The imaging frequencies are w= 983 cm! 

(U-RB; a), and 893 cm™! (L-RB; b). Dashed white lines indicate the 

[100] and [001] surface directions. Scale bars, 21m. c, d, Absolute value 

of the Fourier transform |S,(k,,ky)| of the near-field images in a and b, 
respectively (colour key at right), revealing the iso-frequency contours for 
each RB. Solid lines show the iso-frequency contours of the PhPs obtained 
by fitting equation (1) for each case (note that they correspond to 2k). Scale 
bars are 50kp and 20kp, for the U-RB and L-RB respectively, with ko being 
the momentum of light in free space. e, f, Calculated near-field amplitude 
images |E,(x,y)| (Supplementary Information) for an «-MoO; disk (colour 
key at right) at w= 983 cm~! (U-RB; e) and 893 cm! (L-RB; f). Scale bars, 
2m. 


To corroborate our experimental results theoretically, and to extract 
the as-yet unknown anisotropic permittivity of a-MoO3, we model the 
a-MoOs flake as a 2D conductivity layer of zero thickness (Methods). 
We find the following dispersion relation for polaritons in a thin 
in-plane anisotropic slab surrounded by two dielectric half-spaces 
with isotropic permittivities ¢, and €2 (Supplementary Information): 
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where k,, and k,, = fe,,kj—kz—k; are the in- and out-of-plane 
wavevectors, respectively, ko =27/Xp is the wavevector in free space, 
and d= 276,,/c is the normalized conductivity, introduced for con- 
venience. Using equation (1) with a,, and ayy as fitting parameters, we 
obtain excellent agreement with the elliptical and hyperbolic features 
in Fig. 3c, d, where our fits are shown as white solid lines. Neglecting 
absorption, we find a,,.= —0.12i (€,.=2.6) and ay, = —0.16i (€,,=3.7) 
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for w=983 cm! (U-RB), and ay,=0.26i (€,,=—6.4) and Ay = —0.07i 
(€yy = 1.7) for w= 893 cm! (L-RB). 

We corroborate the model and permittivity values by numerical sim- 
ulation of near-field images of an a-MoO; disk on SiO (Supplementary 
Information). We used the nominal experimental values of 144 nm 
and 6\1m for the disk thickness and diameter, respectively, and the 
anisotropic real-valued permittivities obtained from the fit described 
above. The imaginary parts of the permittivities and the value of €,, 
(not obtained from the fit) were adjusted to obtain the best matching 
of the experimental images and of the sign of the phase velocities in 
each RB. As a result of our analysis, we find €,. >0, Ey, > 0 and €z,<0 
for the elliptic U-RB, and €,, <0, ey, > 0 and €,,>0 for the hyperbolic 
L-RB (Supplementary Information). The simulated polariton inter- 
ferometry amplitude images are shown in Fig. 3e, f. Their excellent 
agreement with the experiments (Fig. 3a, b) validates both the model 
and the permittivity values. The results demonstrate that experimental 
PhP interferometry of «-MoO; disks and fitting of the results with our 
simple theoretical model allow the highly anisotropic local permittiv- 
ities of «-MoO; to be extracted. 

The conductivity tensor 6,,—and thus the wavevectors of the PhPs— 
depends on the slab thickness d. According to the relation between Gur 
and é (see Methods), we obtain from equation (1) the thickness- 
dependent anisotropic in-plane polariton wavevectors (Supplementary 
Information): 


—-——,1=x,y (2) 


In Fig. 4a, we demonstrate the thickness tunability of in-plane hyper- 
bolic polaritons: we plot the PhP dispersions obtained by s-SNOM 
nano-imaging along the [100] crystal direction of «-MoO; flakes with 
different thickness d. We clearly observe that the wavevector k, and 
thus the polariton confinement increase with decreasing thickness. For 
d=55 nm, we find k, values of about 3.5 x 10° cm7!, corresponding 
to a PhP wavelength of 180 nm. This value is 60 times smaller than 
Ao= 10.8 um, suggesting that in-plane anisotropic propagation could 
be well paired with deep subwavelength-scale field confinement for 
the development of ultra-compact devices. The inverse dependence 
of k, on d is better observed in Fig. 4b, where we plot the experimen- 
tal k, (red dots) obtained at w=902 cm™! for four flakes of different 
thickness. These experimental values are well matched by our equation 
(2) (grey curve), where we used €,,=—5.1 as extracted for the flake 
with d= 144 nm in Fig. 3, thus strongly supporting the validity of our 
approximation. 

A key property of polaritons for future applications is their life- 
time!!! To measure it, we fitted s-SNOM amplitude line profiles 
along the [100] direction (blue and red crosses in Fig. 4c) with an 
exponentially decaying sine-wave function corrected by the geomet- 
rical spreading factor ./x (Supplementary Information)!°. From the 
amplitude decay length L, (one of the fitting parameters) we obtain 
the lifetime according to T,=L,/vg, where the group velocity vz is taken 
from Fig. 2c. For the in-plane hyperbolic PhPs we obtain 
Tx = 1.9 +0.3 ps, which reveals the ultra-low-loss character of these 
polaritons. Surprisingly, for the in-plane elliptic PhPs we obtain 
T,=8+1 ps (four times higher than that of PhPs in isotropically 
enriched h-BN''). On some flakes we find lifetimes up to 22 ps 
(Supplementary Information). We note that in contrast to low-loss 
h-BN PhPs!! and graphene plasmons’’, a rather small number of 
fringes were observed on a-MoO; flakes. This can be explained by the 
small group velocities of the MoO; PhPs, which yield relatively short 
propagation lengths. The ultra-long PhP lifetimes are corroborated by 
the ultra-narrow linewidths of the a-MoO; Raman peaks 
(Supplementary Information) at 996 cm~! and 820 cm“! (correspond- 
ing to anisotropic bond stretching modes* that originate in the U-RB 
and L-RB, respectively), revealing very high crystal quality. A similar 
relation has been recently reported to explain the large lifetimes 
observed in isotopically enriched h-BN"’. 
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Fig. 4 | Thickness tunability and lifetime of in-plane hyperbolic and 
elliptic PhPs in a-MoO3. a, Experimental (dots) PhP dispersions along 
the [100] direction in a-MoO; for a varying flake thickness d (lines 

are guides to the eyes). b, Experimental (dots) and calculated (line) 
dependence of k, upon d. c, s-SNOM line traces (showing the real part 
of the complex-valued s-SNOM signal o4; Methods) along the [100] 
direction of the flake shown in Fig. 2b with d= 250 nm in the elliptic 
(blue crosses, w = 990 cm) and hyperbolic (red crosses, w= 930 cm~!) 
regimes. Damped sine-wave functions (black solid lines) were fitted 

to the data that correspond to edge-launched PhPs (Supplementary 
Information). Amplitude lifetimes 7,=8 + 1 ps and 7,=1.9+0.3 ps are 
obtained for the U-RB and L-RB, respectively. 


In-plane anisotropic a-MoO3; PhPs add anew member to the grow- 
ing list of polaritons in vdW materials. In combination with external 
stimuli, such as strain, electric gating or photo-injection of carriers, we 
envisage active tuning of the anisotropic PhP properties. Our findings 
may thus establish a route to directional control of light and light- 
matter interactions at the nanoscale. 
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Any methods, additional references, Nature Research reporting summaries, source 
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METHODS 


s-SNOM and nano-FTIR set-up. For infrared nano-imaging we used a scattering- 
type scanning near-field optical microscope (s-SNOM*-°, from Neaspec). 
Metallized, cantilevered atomic force microscope (AFM) tips are used as scattering 
near-field probes. The tip is oscillating vertically at the mechanical resonant fre- 
quency (around 270 kHz) of the cantilever, with an amplitude of about 50 nm. The 
tip is illuminated with p-polarized infrared light of frequency w (from tunable CO, 
and quantum cascade lasers) and electric Ein,, while the a-MoO; flake is raster- 
scanned below the oscillating tip. Acting as an infrared antenna, the Pt-coated tip 
concentrates the incident field into a nanoscale spot at the apex, which interacts 
with the sample surface and thus modifies the tip-scattered field Egsca. Esca is 
recorded with a pseudo-heterodyne Michelson interferometer**. Demodulation 
of the interferometric detector signal at the nth harmonics of the tip oscillation 
frequency yields the complex-valued near-field signals 7, = s,e'*", with s, being 
the near-field amplitude and vp, being the near-field phase. By recording the near- 
field signals as a function of the lateral tip position, we obtain near-field images or 
line trace. In the particular case of probing a material supporting polaritons, the 
nanoscale ‘hotspot at the tip apex acts as a local source of polaritons*°. The tip- 
launched polaritons reflect at the flake edges and produce polariton interference, 
yielding fringes in the near-field images (Figs. 2b, 3a, b). The distance between the 
interference fringes corresponds to half the polariton wavelength, \/2. 

For nano-FTIR spectroscopy*®, the tip was illuminated by a broadband 
super-continuum laser, and the tip-scattered light was recorded with an asym- 
metric Fourier transform spectrometer. By recording point spectra as a function 
of the tip position, we obtained high-resolution spectral line scans°. 

Disk fabrication. Bulk MoO; crystals were grown via chemical vapour deposi- 
tion. Commercial MoO; powder (Sigma-Aldrich) was evaporated in a horizontal 
tube furnace at 785°C and was re-deposited as «-MoO3 crystals at 560°C. The 
deposition process was carried out in an inert environment (Ar flow of 200 sccm) 
at 1 torr*’. The as-grown bulk crystals were then mechanically exfoliated and 
transferred onto a Si/SiO2 (thickness 300 nm) substrate. The transferred flakes 
were inspected with an optical microscope and characterized via AFM, allowing 
the selection of large and homogeneous pieces with the desired thickness. The 
selected flake was then shaped into a disk by using focused Ga-ion beam milling 
in a FEI Helios 600 Nanolab dual beam system. In order to protect the surface of 
the disk from the implantation of Ga ions, the flake was first covered by placing a 
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thin diamond shield (approximate dimensions 100 1m x 80}1m x 0.5 jum) on top 
of the flake using the tip of an Omniprobe micromanipulator. Using the ion beam, 
we then milled through both the diamond shield and the flake using a ring-shaped 
milling pattern, until we reached the substrate. The remaining parts of the diamond 
shield were then lifted off the surface using the Omniprobe micromanipulator to 
give only the disk-shaped flake separated from the bulk flake by a ring-shaped 
channel. 

Conductivity model for MoO; layers. Modelling the a-MoO; flake as a 2D con- 
ductivity layer of zero thickness avoids the calculation of the fields inside the slab”, 
and has been proven valid for in-plane isotropic 2D materials (for example, 
graphene” and transition layer polaritons*”) with a layer thickness that is much 
smaller than the polariton wavelength (d< ). In the model, the effective con- 
ductivity for the isotropic layer is given by oeg= [cd/(2iAo) le, where ¢ is the 
in-plane isotropic permittivity (both ¢ and ogg are scalars). Note that oer scales 
linearly with d, thus taking into account the effect of the small slab thickness. 
Analogously, we model the «-MoQ; layer by an anisotropic in-plane conducting 
layer with zero thickness and an effective 2D conductivity tensor, G4. The gener- 
alized relation between the tensor dj and the (2 x 2) permittivity tensor 
é= diag(€,,,, €)y) is then given by d¢ = (cd/2iA9)é. Note that the model is inde- 
pendent of the out-of-plane permittivity component ¢,,, which subsequently does 
not enter into equation (1). 


Data availability 

All the data are available in the online version of the paper. The data that support 
the findings of this study are available from the corresponding authors on rea- 
sonable request. 
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A protein functionalization platform based on 
selective reactions at methionine residues 


Michael T. Taylor}, Jennifer E. Nelson!, Marcos G. Suero! & Matthew J. Gaunt! 


Nature has a remarkable ability to carry out site-selective post- 
translational modification of proteins, therefore enabling a marked 
increase in their functional diversity’. Inspired by this, chemical 
tools have been developed for the synthetic manipulation of 
protein structure and function, and have become essential to the 
continued advancement of chemical biology, molecular biology 
and medicine. However, the number of chemical transformations 
that are suitable for effective protein functionalization is limited, 
because the stringent demands inherent to biological systems 
preclude the applicability of many potential processes”. These 
chemical transformations often need to be selective at a single site 
on a protein, proceed with very fast reaction rates, operate under 
biologically ambient conditions and should provide homogeneous 
products with near-perfect conversion?-’. Although many 
bioconjugation methods exist at cysteine, lysine and tyrosine, 
a method targeting a less-explored amino acid would considerably 
expand the protein functionalization toolbox. Here we report the 
development of a multifaceted approach to protein functionalization 
based on chemoselective labelling at methionine residues. By 
exploiting the electrophilic reactivity of a bespoke hypervalent 
iodine reagent, the S-Me group in the side chain of methionine can 
be targeted. The bioconjugation reaction is fast, selective, operates 
at low-micromolar concentrations and is complementary to existing 
bioconjugation strategies. Moreover, it produces a protein conjugate 
that is itself a high-energy intermediate with reactive properties and 
can serve as a platform for the development of secondary, visible- 
light-mediated bioorthogonal protein functionalization processes. 
The merger of these approaches provides a versatile platform for the 
development of distinct transformations that deliver information- 
rich protein conjugates directly from the native biomacromolecules. 

The sheer structural diversity of the proteome in any single organism 
means that no one protein functionalization method is likely to pro- 
vide a universal solution for the preparation of protein constructs*"!! 
(Fig. 1a). Although encoded by the AUG start codon at the beginning 
of protein synthesis, methionine is often post-translationally excised 
and thus has a low abundance in proteins (around 2%). It is also 
frequently used as a replacement for hydrocarbon-containing 
residues'*. The limited function of methionine, being mainly 
responsible for protection against oxidative stress, compared to other 
residues means that its functionalization is less likely to impair protein 
function!?, Targeting methionine would not only provide a distinct 
bioconjugation approach but also, using our strategy, the resulting 
methionine conjugates would provide exploitable intrinsic reactivity 
such that they could lead to the rapid synthesis of diverse, functional 
protein constructs from native proteins. 

So far, there has been only one effective method reported for bio- 
conjugation at methionine, in which the oxidation of thioethers with 
oxaziridine reagents provided the basis for an elegant bio-inspired 
strategy to form stable protein-bound sulfoximines'*. We aimed to 
target the polarizable thioether on the methionine side chain with a 
suitable electrophile to form a cationic sulfonium species, selectively 
installing a versatile payload and distinct functionality at a methionine 


residue, thereby providing a fundamentally different bioconjugation 
approach®"'! (Fig. 1a). Methionine residues react with cyanogen 
bromide (CNBr); however, such a process cannot function as a 
bioconjugation method because the instability of the resulting cyano- 
substituted sulfonium cation triggers cleavage of the protein 
backbone!®. Conversely, methionine residues undergo slow S-alkylation 
reactions with iodoacetamide or with other benzyl-derived electrophiles 
to form relatively stable trialkyl sulfonium cations'®'’. The need to 
strike a balance between the stability of the protein-bound sulfonium 
cations, the compatibility of the reaction conditions, and the reaction 
rate of the thioether with suitable electrophiles has, so far, 
precluded the development of an effective alkylation-based method for 
bioconjugation at methionine. Guided by these limitations, we posited 
that a distinct class of electrophile, based on the hypervalent iodine 
scaffold of \?-iodanes”, could make for a functioning bioconjugation 
process at methionine. Tailoring the substituents and counteranion 
on the I(11r) atom should enable us to tune both the reactivity of 
the polarizable I(111) nodal centre, to dovetail with the electron lone 
pair of the thioether, and also the stability of the resulting sulfonium 
conjugate, through modulation of the electronic features of the groups 
directly attached to the cationic sulfur motif. We noted that a struc- 
turally remarkable iodonium salt (1 in Fig. 1, R= Et and X= OTf 
(trifluoromethanesulfonate, also known as triflate)) reacts rapidly with 
dimethylsulfide (the simplest possible mimic of the thioether motif in 
methionine) to form a sulfonium adduct?!?. Successful reaction of 
this iodonium salt with methionine would not only represent a distinct 
method for bioconjugation, but also deliver a high-energy conjugate 
equipped with reactive ‘on-protein’ groups that could serve as a basis 
for designing new transformations towards protein constructs with 
diverse functionality (Fig. 1b). 

We first examined the reaction between dipeptide 2a and iodonium 
triflate 1a (Fig. 2a). We observed the formation of the desired sulfonium 
conjugate 3a, although it was clear from the low yield (27%) 
that the iodonium salt was poorly stable in aqueous solution. By 
tailoring the aryl group of the iodonium salt (to the electron-deficient 
2,4-difluorobenzene) and replacing the triflate counteranion with 
tetrafluoroborate, we found that a readily prepared reagent 1b displayed 
superior physical properties (half-life in water is >50 h). Treatment 
of reagent 1b with dipeptide 2a gave 72% of the desired product 3b 
accompanied by the corresponding sulfoxide (not shown) after reaction 
for 30 min. 

Moving to a more complex substrate, the GLP-1 receptor agonist 
exenatide (Byetta, 2b, a 39-residue helical polypeptide containing 
a single mid-chain methionine, Fig. 2b), we found that treatment 
of a 100 j1M aqueous solution with 1b led to decomposition of the 
polypeptide. A key breakthrough revealed that the addition of a 
low concentration of thiourea (20 mM) resulted in a substantial 
improvement of the reaction, such that sulfonium conjugate 4a was 
formed with 68% conversion in less than 2 min, accompanied by 
non-specific oxidation and labelling. Further improvements could 
be made by performing the reaction in the presence of TEMPO 
((2,2,6,6-tetramethylpiperidin-1-yl)oxyl, 10 mM), which minimized 
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the formation of oxidative by-products, and adding aqueous formic 
acid solution (5 mM, approximately pH 3), which reduced the forma- 
tion of non-specifically labelled by-products to trace levels. Finally, we 
found that the labelling process proceeded effectively when conducted 
in distilled water. Routine analysis of electrospray ionization mass 
spectrometry (ESI-MS) and tandem mass spectrometry (MS/MS) 
fragmentation data confirmed selective reaction at methionine and, 
although the concentration of thiourea is well below the levels needed 
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for protein denaturation, it was confirmed by circular dichroism 
spectra of exenatide conjugate 4a that the characteristic helical structure 
was retained (see Supplementary Fig. 10). Although we are not yet 
certain of the role of thiourea, it is important to note that its presence 
appears to be fundamental in providing a bioconjugation process at 
reaction rates needed for transformation on complex biomolecules. 
With these refined conditions, the thiourea-accelerated bioconjugation 
was almost instantaneous at exenatide concentrations ranging from 
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5-500 1M without compromising the conversion, and a larger, 79% yield of isolated product. Given the similarity of the exenatide 
milligram-scale reaction enabled the purification of conjugate 4a by _ conjugate to the intermediate that would arise from reaction of the 
semi-preparative high-performance liquid chromatography to givea_ _ polypeptide with cyanogen bromide (leading to peptide cleavage), 
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it is remarkable that 4a has a half-life in water of over 100 h. We believe 
that the observed stability of 4a is a result of the ethyl diazoacetate 
motif imparting a lesser electron-withdrawing effect (than the cyano 
group) on the sulfonium salt, which in turn leads to a species that is 
less reactive to attack by the proximal carbonyl group and is, therefore, 
a more stable conjugate. 

The optimized conditions, using 1b, were used to evaluate the 
substrate scope (Fig. 3a). Random coil polypeptides that contain 
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(12:1, o-tol:H transfer) C-benzylation exenatide construct, 18a 


substrate scope. Notably, the reduction of 3¢ proceeds in light, without the 
iridium photocatalyst, but the yield is greatly reduced. c, Secondary protein 
functionalization via photoredox radical cross-coupling between the diazo 
sulfonium-protein conjugate and the C-4 substituted Hantzsch ester. 


methionine, such as aviptadil and tetracosactide, are efficiently 
converted to the sulfonium conjugates 5 and 6 respectively. GTP- 
binding protein fragment Ga, which contains a free sulfhydryl group 
in an N-terminal cysteine residue, underwent smooth methionine 
labelling to 7 with concomitant oxidative disulfide formation. 
Teriparatide, a polypeptide containing two methionines, formed 
bis-sulfonium conjugate (8) with good conversions. a-Lactalbumin, 
a globular protein with a readily oxidized methionine residue, 


© 2018 Springer Nature Limited. All rights reserved. 


undergoes bioconjugation to 9 with minimal competitive oxida- 
tion. Aprotinin (Trasylol) also forms the corresponding sulfonium 
conjugate 10 in good conversion to product. Particularly notable 
is the observation that two cysteine disulfide linkages within the 
structure of aprotinin (and the labelling to 7) are not affected by 
1b, with a single methionine-derived conjugate obtained in high 
conversion. When the methionine residues are buried within the 
tertiary structure of the protein, for example with RNA-ase B, the 
bioconjugation does not occur, highlighting the inherent selec- 
tivity of the labelling process for exposed rather than inaccessible 
thioether groups. This feature is highlighted by the case of ubiquitin: 
the N-terminal methionine residue has only moderate surface expo- 
sure and can slow down the rate of functionalization to the point 
where oxidation becomes competitive. Accordingly, reaction under 
deoxygenated conditions enabled efficient conversion to the labelled 
product (84%), with a 10:1 label:oxidation ratio (11). The methio- 
nine-selective bioconjugation strategy effectively functionalizes a 
range of polypeptides and proteins in high conversion at micromolar 
concentrations and in very short reaction times. Given that 1b is a 
carbon electrophile with two of the best leaving groups known to 
organic chemists, it would be thought to be very reactive; it is there- 
fore remarkable that this species selectively engages the moderately 
nucleophilic methionine residue in the presence of competitively 
nucleophilic and oxidizable amino acid residues. Bioconjugation 
strategies that target other amino acids should, therefore, be compat- 
ible with, and complementary to, our methionine-functionalization 
process. To exemplify this, we showed that the GTP-binding protein 
fragment Ga could be first labelled at cysteine, using a maleimide 
derivative”*, and then conjugated at methionine using iodonium salt 
1b to form 12. The methionine-selective process does not interfere 
with the cysteine-maleimide motif, which itself contains a thioether 
linkage, thereby highlighting possible applications towards multi-site 
heterolabelling of proteins’. 

The modular synthesis of the hypervalent iodine reagents enables 
facile incorporation of different acyl groups attached to the diazo 
motif, allowing the transfer of a range of functional payloads to 
proteins (Fig. 3b). Functional groups relevant to other bioorthogonal 
reactions are readily tolerated in both the reagent synthesis and the 
methionine labelling, smoothly affording sulfonium conjugates 4b and 
4c with 95% and 85% conversion to product, respectively. Biochemical 
reporter groups, such as myristyl- and fluorescein-derived esters 4d 
and 4e, are also readily transferred to exenatide. Notably, we found that 
sulfonium conjugates of exenatide (such as 4f) underwent reaction 
with the tertiary phosphine TCEP (tris(2-carboxyethyl)phosphine), a 
standard biochemical reagent, resulting in the cleavage of the labelling 
group and return of the parent exenatide 2b in >90% conversion” 
(Fig. 3c); the cleavage reaction also works for conjugates 5, 6 and 8 in 
comparable conversions and provides a stimulus-responsive means of 
removing the methionine label. 

Next, we turned our attention to exploring the multifaceted reactivity 
that we anticipated would be intrinsic to the high-energy methionine- 
derived conjugate. The electrophilicity of the diazo sulfonium conju- 
gate 4 prompted us to investigate the single-electron transfer chemistry 
of this reactive motif enabled by visible-light photocatalysis”®”’. 
The addition of a single electron to the diazo sulfonium conjugate 4 
could result in intermediate 15, which, upon cleavage of the C-N> bond, 
would form a putative radical ylide synthon 15’ (Fig. 4a). We visualized 
two pathways through which we could exploit the reactivity of the 
previously unexplored radical ylide. First, combining the radical ylide 
with Hantzsch ester 13* (from 13) may lead to a reduction process 
resulting in the generation of a trialkylsulfonium motif, which 
would impart enhanced chemical stability to the protein conjugate. 
Furthermore, the use of a C-4 benzylated Hantzsch ester derivative 
(14, an established precursor for a benzyl radical)”* to intercept the 
radical ylide species would lead to a C-benzylation product that could 
be used to introduce functional diversity to the protein conjugate. We 
screened a range of photocatalysts under visible-light conditions. When 
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3c was irradiated with a 30 W lamp in the presence of fac-Ir(ppy)3 (ppy, 
2-phenylpyridinato) and the Hantzsch ester 13°”, we observed the 
formation of the reduced trialkylsulfonium product 17a in 95% yield 
(determined by 'H NMR, Fig. 4b). Using these conditions, we showed 
that a range of sulfonium-protein conjugates, including exenatide, 
glucagon and thioredoxin derivatives, are reduced to stable trialkylsul- 
fonium species with excellent conversions (17b-e, see Supplementary 
Information). Notably, reduction of a thioredoxin derivative?! to its 
trialkylsulfonium-protein congener 17e proceeds in high conversion 
without affecting its labile disulfide linkage, which serves to highlight 
the mild nature of this protocol. Additionally, the methionine 
bioconjugation and photoreduction steps can be carried out in a one-pot 
operation, which considerably simplifies the overall process without 
compromising the yield or purity of the trialkylsulfonium product. 

In testing the viability of the proposed C-benzylation using a 
modified Hantzsch ester derivative?®, we found that treatment of the 
exenatide-sulfonium conjugate 4b with o-tolyl Hantzsch ester deriv- 
ative 14a, under slightly modified photocatalytic conditions, led to 
cross-coupling and the formation of the C-ligation product 18a in 
high conversion. This unique bioorthogonal protein functionalization 
reaction not only represents a synthetic radical—radical cross-coupling 
using a polypeptide scaffold, but also provides a platform for bioortho- 
gonal protein diversification wherein two distinct functionalities could 
be introduced sequentially at the same amino acid residue. 

In summary, through the merger of methionine-selective 
bioconjugation and a new visible-light-mediated photocatalytic 
reaction platform, information-rich synthetic constructs can be rapidly 
assembled by a two-step protocol directly from native proteins. The 
reactivity inherent to the methionine conjugate distinguishes the 
bioconjugation process from other methods. Moreover, the capacity 
for functional diversification, by tailoring the hypervalent iodine 
reagent and modified Hantzsch-ester derivative, means that highly 
functional protein conjugates could be made readily available directly 
from native proteins. 


Data availability 

The data that support the findings of this study are available within the paper and 
its Supplementary Information. Raw data are available from the corresponding 
author on reasonable request. 
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CO, storage and release in the deep Southern Ocean 
on millennial to centennial timescales 


J. W.B. Rae!*, A. Burke!, L. F. Robinson’, J. FE Adkins’, T. Chen”, C. Cole!, R. Greenop, T. Li*, E. F. M. Littley!, D. C. Nita!», 


J. A. Stewart)? & B. J. Taylor! 


The cause of changes in atmospheric carbon dioxide (CO,) during 
the recent ice ages is yet to be fully explained. Most mechanisms for 
glacial-interglacial CO, change have centred on carbon exchange 
with the deep ocean, owing to its large size and relatively rapid 
exchange with the atmosphere’. The Southern Ocean is thought 
to have a key role in this exchange, as much of the deep ocean is 
ventilated to the atmosphere in this region”. However, it is difficult 
to reconstruct changes in deep Southern Ocean carbon storage, so 
few direct tests of this hypothesis have been carried out. Here we 
present deep-sea coral boron isotope data that track the pH—and 
thus the CO, chemistry—of the deep Southern Ocean over the past 
forty thousand years. At sites closest to the Antarctic continental 
margin, and most influenced by the deep southern waters that 
form the ocean's lower overturning cell, we find a close relationship 
between ocean pH and atmospheric CO;: during intervals of low 
CO), ocean pH is low, reflecting enhanced ocean carbon storage; 
and during intervals of rising CO2, ocean pH rises, reflecting loss 
of carbon from the ocean to the atmosphere. Correspondingly, 
at shallower sites we find rapid (millennial- to centennial-scale) 
decreases in pH during abrupt increases in COz, reflecting the 
rapid transfer of carbon from the deep ocean to the upper ocean 
and atmosphere. Our findings confirm the importance of the deep 
Southern Ocean in ice-age CO, change, and show that deep-ocean 
CO, release can occur as a dynamic feedback to rapid climate change 
on centennial timescales. 


The Southern Ocean may act as a net source of CO) from the deep 
ocean to the atmosphere or as a net sink*, depending on the balance 
between regional CO) supply via circulation and CO, removal via bio- 
logical productivity. Various records have shown that large changes in 
circulation*® and biological productivity® occurred in the Southern 
Ocean on glacial timescales, with the potential to change the partition- 
ing of carbon between the deep ocean and the atmosphere. However, 
reconstructions of deep ocean CO; chemistry are currently sparse and 
harder to interpret simply in terms of carbon storage. For instance, 
records of CO, chemistry from the deep Atlantic’ and deep Pacific® 
show decreases in carbonate ion saturation and pH during millennial- 
scale intervals of atmospheric CO) rise; in the absence of other 
processes, low carbonate ion concentration and pH imply an increase 
in CO; storage in the deep ocean, so these signals are thought instead 
to be dominated by changes in circulation and deep water masses. On 
longer timescales, records from the deep Indo-Pacific? appear to reflect 
changes in CO; storage, but are damped by the buffering influence of 
carbonate compensation. Records of deep ocean CO; chemistry that 
clearly demonstrate CO, storage during falls in atmospheric CO2, and 
CO), release during increases in atmospheric COs, have proved elusive. 

Here we test the hypothesis that carbon storage in the deep Southern 
Ocean played an important role in ice-age CO change, with new boron 
isotope (5'!B) data from uranium-thorium-dated deep-sea corals from 
the Drake Passage’ (Fig. 1; see Methods). The boron isotope pH proxy 
(see Methods) provides a sensitive measure of the ocean carbonate 


pH and neutral density contours 


50°S 


55° S 


Depth (m) 
S 
fo} 
oO 


60°S 


65° S|! 


fil 3,000 
so°Ww 70°W 60°W 50° W 0 


Fig. 1 | Locations of deep-sea coral samples. The cross-section (right) 
was constructed from hydrographic stations across the Drake Passage 
(dark blue dots on map, left). Steeply dipping isopycnals in this region 
(white contours) mean that our sites fall into two groups that span distinct 
volumes of the deep ocean’'. The green triangle marks the lower cell sites, 
which lie close to the Antarctic continental margin in the Shackleton 
Fracture Zone; the blue diamonds mark the upper cell sites, which lie 

at lower densities on the Sars (lighter blue) and Interim (darker blue) 
seamounts. Lower cell waters are rich in DIC (shading, right) with low pH 
(black contours); upper cell waters have higher pH and are more closely 


BT 8.0: oT 


ie} 


ie} 


oa 
oO oO 
(,-6y jown) o1d 


400 600 800 


Section distance (km) 


1,000 


connected to the atmosphere. Coral locations on the section are given 

in coordinates of depth and neutral density, based on water column data 
collected alongside the coral dredges. Note that there is no significant 
offset in our !'B data between upper cell corals from different depths 
(Extended Data Fig. 2), and that differences in pH between these sites are 
small compared to the range seen in our records. The 27.9 neutral density 
contour, which today lies at the boundary between the lower and upper 
cells'!, is shown in the map view (left) at 820 m, the average depth of our 
corals. Section data are from GLODAPv2***», plotted using isopycnal 
gridding in Ocean Data View. 
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Fig. 2 | Deep Southern Ocean CO) chemistry, atmospheric CO, and 

Antarctic climate records over the past 40,000 years. Green triangles 

and blue diamonds show lower and upper cell deep-sea coral §!'B data, 

respectively. Individual subsamples are shown as small open symbols and 

mean values as larger filled symbols. Error bars on individual subsamples 

are equivalent to 2 s.d. analytical reproducibility and error bars on 

mean coral values represent 2 s.e. uncertainty on the mean of replicate 

subsamples (see Methods). Synchronized ice core CO, data*® are shown 

in grey symbols: circles from Dome C, dots from WAIS, stars from Taylor 

Dome, triangles from TALDICE, pluses from EDML, diamonds from Byrd, 

and squares from Siple Dome. West Atlantic ice sheet (WAIS) 6'8O (orange 

line), which reflects Antarctic temperature, and sea salt sodium (ssNa, blue 

line), a proxy for sea ice, have been smoothed with a running mean”*. Grey 

bands highlight intervals of CO) rise. 


system, closely tracking CO2 concentrations and reflecting the ratio of 
the two master variables, dissolved inorganic carbon (DIC) and alka- 
linity. Although full reconstruction of the carbonate system requires 
knowledge of a second parameter, it is unlikely that alkalinity was lower 
in the glacial ocean”®, or varied as dynamically as DIC, so our §!'B pH 
record can be largely attributed to changes in carbon storage. Note that 
as our 6!'B record extends beyond the pH calibration possible in mod- 
ern Desmophyllum dianthus (see Methods), we focus our discussion on 
relative changes in pH as traced by coral 6''B, and provide absolute pH 
estimates in Extended Data Fig. 1 for reference. Our sample sites reflect 
distinct volumes of the deep ocean!!: the ‘lower cell’ sites lie close to the 
Antarctic continental margin, bathed by waters that plumb the middle 
to lower depths of the deep ocean; the ‘upper cell’ sites lie on lighter 
isopycnal surfaces, bathed by waters found at shallower depths in the 


570 | NATURE | VOL 562 | 25 OCTOBER 2018 


ocean basins (Fig. 1 and Extended Data Fig. 2). Note that there is no 
significant offset in our 6''B data between upper cell corals from differ- 
ent depths (Extended Data Fig. 2), and that differences in pH between 
these sites are small (Fig. 1) compared to the range seen in our records. 

During the last glacial maximum (LGM) and early deglaciation, 
we see a clear gradient between the lower cell sites, which show low 
6!!B and pH, and the upper cell sites, with relatively high &''B and pH 
(Fig. 2). This supports the idea that during glacial intervals the deep 
ocean—and its carbon—was more stratified into two cells with limited 
interactions'!. Our data show that the lower cell was rich in carbon 
compared to the upper cell and compared to modern values expected 
at this site (Fig. 1, Extended Data Fig. 1), providing strong support for 
the hypothesis that the deep glacial ocean sequestered carbon from the 
upper ocean and the atmosphere’. 

During the deglaciation, this gradient in deep carbon breaks down, 
with lower cell pH rising in step with atmospheric COz, and pH in the 
upper cell falling towards lower cell values (Fig. 3). This provides direct 
evidence for the transfer of carbon from the deep ocean to the upper 
ocean and the atmosphere. Carbon transfer to the upper ocean appears 
to be particularly pronounced at approximately 14.7 and 11.7 thou- 
sand years ago (kyr ago), coincident with the centennial-scale jumps 
in atmospheric CO,” associated with abrupt warming in the Northern 
Hemisphere. This provides the first evidence, to our knowledge, ofa fast 
teleconnection between abrupt changes in the North Atlantic and the 
carbon chemistry of the deep Southern Ocean. Lower cell D. dianthus 
samples have not been found in the Holocene (see Methods), but the 
available data at the end of the deglaciation and in the modern water 
column suggest much weaker pH gradients, consistent with less- 
pronounced property gradients in the modern deep ocean compared 
to the glacial deep ocean! »!>!4, 

Our data show that the carbonate chemistry of the deep Southern 
Ocean was closely linked to atmospheric CO, changes over the past 
40,000 years. These data thus provide a crucial missing piece of the 
glacial CO2 puzzle: the most direct evidence to date of deep Southern 
Ocean carbon storage and release, as previously inferred from phys- 
ical properties”), carbon isotopes}, and oxygen content!® (Fig. 2, 
Extended Data Fig. 3). While other processes* and regions® may have 
contributed to the full magnitude of glacial-interglacial CO2 change, 
our data demonstrate a key role for the Southern Ocean on millennial 
to centennial timescales. 

Several processes might have contributed to the changes in CO, 
storage observed in our record, including changes in ventilation*’®, 
biological pump efficiency®, and sea ice!” (Extended Data Fig. 3). We 
note a close correspondence between lower cell pH and ice core sea 
salt sodium, a proxy that may reflect changes in sea ice production!*” 
(Fig. 2), which suggests that sea ice may have played an important role 
in CO, change. Sea ice has the potential to influence CO) storage both 
through its influence as a ‘lid’ on surface-ocean outgassing”, and its 
effect on deep circulation!!. Expansion of sea ice at the LGM?! along- 
side an increase in surface ocean density in the Southern Ocean rela- 
tive to the North Atlantic”°, would have helped to create an expanded 
lower cell with salty!3, CO,-rich water. This could have helped to shoal 
the upper-lower cell boundary above the zone of enhanced mixing 
over rough bottom topography’, trapping salt and CO; in the abyss. 
Accumulation of CO, at depth would have been further promoted by 
an enhanced biological pump due to iron fertilization® and increased 
upper ocean stratification, which would also have reduced CO; escape 
through leads and under ice-free conditions. 

This framework may also explain the release of CO, from the deep 
Southern Ocean on millennial timescales”!. Increases in CO; typi- 
cally occur during intervals of cold stadial conditions in the Northern 
Hemisphere and warming in the south (the bipolar seesaw)”. This 
southern warming is associated with a decrease in southern sea ice 
(Figs. 2, 3) and a decrease in the surface density gradient between the 
Southern Ocean and the North Atlantic, shifting the boundary between 
the overturning cells to greater depth in the basins!!°, CO,-rich water 
that was previously isolated in the abyss could thereby have been mixed 
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Fig. 3 | Deglacial records of deep Southern Ocean CO) chemistry, 
atmospheric CO, and climate over Antarctica and Greenland. Green 
triangles and blue diamonds show lower and upper cell deep-sea coral 
5!'B data, respectively. Individual subsamples are shown as small open 
symbols and mean values as larger filled symbols. Error bars on individual 
subsamples are equivalent to 2 s.d. analytical reproducibility and error bars 
on mean coral values represent 2 s.e. uncertainty on the mean of replicate 
subsamples (see Methods). Greenland ice core 6'8O (dark blue line), WAIS 
5'80 (orange line), and sea salt sodium (blue line) have been smoothed 
with a running mean”*. CO, data (grey symbols) are from WAIS” with a 
five-point running mean (grey line). Light grey vertical bands highlight 
intervals of millennial-scale CO} rise; dark grey vertical bands highlight 
intervals of centennial-scale CO; rise associated with North Hemisphere 
warming; and the vertical dashed line indicates the rapid CO) rise event 
that occurred at 16.3 kyr ago, within Heinrich Stadial 1. 


into the upper cell over rough topography in the ocean basins, and/or 
transferred into the upper cell upon upwelling north of the sea ice 
edge, perhaps aided by the westerly winds”** or increased mixed layer 
depths in the Southern Ocean. CO, loss from the deep ocean might also 
have been aided by reduced biological pump efficiency®. Whatever the 
exact mechanism, this carbon transfer is recorded by a pH increase in 
our lower cell corals and a pH decrease in our upper cell corals, as CO2 
was transferred to the upper ocean and the atmosphere. Note that as a 
southward shift in the fronts at these times might be expected to expose 
our sites to higher-pH water (Fig. 1), the transfer of low-pH DIC-rich 
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water into the upper cell might have been even larger than observed. 
Indeed, a southward frontal shift and breakdown in stratification are 
likely to explain the younging seen in upper cell radiocarbon*”® at this 
time (Extended Data Fig. 4). Upwelling of carbon and nutrient-rich 
water during cold Northern Hemisphere stadials is also supported 
by low pH in surface waters, as recorded by boron isotopes in plank- 
tic foraminifera and enhanced opal fluxes”? (Fig. 3, Extended Data 
Fig. 4). If salt from the high-salinity lower cell!? was also transferred 
back into the upper cell, this might have aided the re-initiation of North 
Atlantic deep water (NADW) formation?’. Once interstadial conditions 
were reestablished in the North Atlantic, the Southern Ocean would 
have started to cool via the bipolar seesaw and sea ice expanded”*. This 
would shoal the cell boundary, reduce ocean-atmosphere exchange, 
and allow CO; and salt to again become trapped in the abyss (Fig. 4). 

The centennial-scale increases in CO) at 14.7 and 11.7 kyr ago”, 
which are associated with pronounced minima in upper cell pH 
(Fig. 3), appear to require a more efficient mode of Southern Ocean 
CO} release associated with abrupt Northern Hemisphere warming”. 
High-resolution records of Antarctic deuterium excess indicate that 
there was a northward shift in the southern westerly winds synchronous 
with abrupt Northern Hemisphere warming”; by contrast, the bipolar 
seesaw cooling response in the south is lagged, with Antarctic §'8O and 
sea salt sodium taking around 200 years to show the onset of cooling 
conditions and increased sea ice production”® (Fig. 3). This might have 
led to a transient condition in the Southern Ocean where CO, could 
be efficiently mixed up from the lower cell”, owing to the fast shift in 
the winds, and then outgassing unimpeded by sea ice, which had not 
yet expanded'’. The northward shift in the fronts might also have con- 
tributed to the pH minima by exposing our upper cell sites to lower-pH 
water from the south (Fig. 1), but such a shift would have to have been 
large given that upper cell pH appears to have reached values similar to, 
or even lower than, the lower cell at this time, suggesting that increased 
input of CO>-rich water was required. Increased input of previously 
isolated carbon-rich waters is also seen in radiocarbon data*”°, which 
show an interruption of their deglacial younging and a slight increase 
in age during these events (Extended Data Fig. 4). Whatever the exact 
mechanisms involved, our data demonstrate that the Southern Ocean 
can fill with CO>-rich waters on centennial timescales and can therefore 
give out its carbon rapidly, countering suggestions that centennial-scale 
increases in CO, are too quick for a deep ocean driver and require exog- 
enous carbon addition (such as from methane hydrates or the terrestrial 
biosphere*”). Our data show that rapid changes in the Southern Ocean 
acted in concert with resumption of vigorous Atlantic meridional over- 
turning circulation (AMOC)”> to drive rapid increases in CO. 

Although our lower cell 6!'B pH data generally show a close coupling 
with atmospheric COz, this relationship was muted during peak gla- 
cial conditions, with pronounced minima in pH at approximately 26 
and 20 kyr ago, alongside extensive sea ice”®, low upwelling”’, and an 
efficient biological pump (Extended Data Fig. 3), but minimal change 
in atmospheric CO2. This supports the idea that there is a lower limit 
on atmospheric CO; at about 190 p.p.m.*}; although Southern Ocean 
carbon storage continues to increase, its influence on the atmosphere 
appears to be offset by other processes, perhaps the onset of CO2- 
limitation of primary productivity on land?’. 

Overall, our data provide a clear demonstration that storage and 
release of CO> in the deep Southern Ocean played a central role in 
glacial—interglacial changes in atmospheric CO . These changes in 
ocean CO; storage are likely to have been driven by a combination 
of changes in ocean circulation, biological pump efficiency, and sea 
ice cover. We note a close correspondence between CO, storage and 
ice core sea ice sodium records, which may suggest that Southern 
Ocean sea ice played a key role in glacial CO2 change, owing to its 
joint influence on deep overturning and surface outgassing. This pro- 
vides a mechanistic explanation for the tight link between Antarctic 
temperature and CO) change on glacial—interglacial timescales, 
although several processes acting together are likely to be required 
to explain the full magnitude of glacial CO, change. Our data also 
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Fig. 4 | Schematic of changes in sea ice, circulation, and deep ocean 
carbon storage. a, At the LGM, Southern Ocean cooling and extensive 
sea ice helped to create an expanded lower cell with salty, CO2-rich water. 
The cell boundary was shoaled above the zone of enhanced mixing over 
rough bottom topography, isolating salt and CO, in the abyss. b, During 
Northern Hemisphere stadial events, North Atlantic overturning was 
reduced, the Southern Ocean warmed, and sea ice retreated. This shifted 
the cell boundary such that water previously isolated in the lower cell now 
upwells north of the sea ice edge. This water, and its CO and salt, were 
transferred to the upper ocean and CO) outgassed to the atmosphere. 
Transfer of salt from the lower to the upper cell might have helped to 
re-initiate NADW formation. c, At the onset of a Northern Hemisphere 
interstadial event (for example, Bolling—Allerod, end Younger Dryas), 


highlight the ability of the Southern Ocean—and its CO,—to respond 
to millennial and centennial-scale shifts in climate linked to the North 
Atlantic’s overturning circulation. Indeed, it is possible that the frame- 
work presented here, which links the storage and release of deep ocean 
CO, and salt to changes in Southern Ocean sea ice and the bipolar 
seesaw, may help to account for the occurrence of millennial-scale 
changes in CO) and climate during mid-glacial conditions. At these 
times, moderate sea ice extent means that the boundary between the 
ocean's lower and upper cells was located near the top of rough sea- 
floor topography. Small shifts in the cell boundary, linked to changes 
in the Southern sea ice edge, could therefore have driven large shifts in 
the degree of mixing between the lower and upper cells, and their salt 
and COp. This framework may also give behaviour similar to ‘density 
oscillator’ models for rapid climate change?”**?: when the AMOC is 
active and the Northern Hemisphere warms, the south cools and sea 
ice expands, progressively isolating cold salty water (and CO ) in the 
lower cell. This may make the AMOC vulnerable to collapse, at which 
point the south warms, sea ice retreats, and the cell boundary deepens. 
This helps to mix salt (and CO;) back into the upper cell and makes 
the lower cell fresher and warmer, reducing the deep ocean density 
contrast, and helping to poise the system for AMOC resumption. 
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METHODS 

Sample collection and chronology. Deep-sea coral samples were collected by 
dredge during two cruises in the Drake Passage (NBP0805 and NBP1103). Sample 
locations are given in Extended Data Tables 1, 2, and sample depths are also shown 
in Extended Data Fig. 2. A total of 392 D. dianthus samples were initially ‘recon- 
naissance’ dated, either by 1403738 or laser ablation U-Th2””, to obtain preliminary 
ages. Suitable samples with ages within the past ~50 kyr were then precisely dated 
using isotope dilution U-Th by multicollector-inductively coupled plasma mass 
spectrometry (MC-ICPMS)*”°. All ages have been published previously, and have 
typical uncertainties of around + 1% (2 s.d.), although this varies between samples 
depending on their initial 7°°Th (calculated from measured 7**Th and assuming 
an initial atomic ?°°Th/?*’Th ratio of 12,500 + 12,500). Age errors are plotted in all 
time series figures but are typically smaller than the symbols. This precisely dated 
and unbioturbated deep sea archive provides a unique record of ocean pH change 
at resolution comparable to those of the ice cores. 

Sample preparation. Coral pieces were sampled from the growing tip of 
coral septa. Samples were physically cleaned using a Dremmel tool to remove 
all visible iron-manganese oxides and any chalky white carbonate, indicative of 
alteration. 

We tested the potential influence of microstructural variability on coral 6"'B 

with multiple solid sub-samples from the same coral (Extended Data Fig. 6). 
Coral centres of calcification have previously been observed to have anomalously 
light boron isotope values, along with high Mg/Ca and low U/Ca*?, and we 
observed this same coupled variability in our coral subsamples (Extended Data 
Fig. 6). However this microstructural signal is typically small compared to the 
size of the boron isotope signals seen in our record. We had expected the smaller 
subsamples to exhibit more scatter, owing to the potential influence of coral centres 
of calcification (COCs), but this is not shown in these data. There is in fact slightly 
more variability between larger chunks, perhaps owing to the increased chance of 
sampling some COC material. For our records we used coral pieces of ~1 mg. We 
also mitigated microstructural influences by taking multiple solid subsamples from 
each coral when possible (shown as open symbols). Two subsamples (at 1.6 kyr 
ago and 20.4 kyr ago) were rejected from our total set of 55 as having anomalous 
8!B values (~1%o lighter than the mean of 3-4 other subsamples from that coral), 
possibly owing to the impact of COC material. 
Boron isotope analysis. Solid coral samples were crushed to a grain size <1 mm 
using an agate pestle and mortar. Samples were then subjected to oxidative cleaning 
to remove organic matter following established protocols**“’, using warm 1% 
hydrogen peroxide, buffered in 0.1 M NH4OH, followed by leaching in 0.0001 M 
HNO, and dissolution in 0.075 M distilled HNO3. Boron was purified from the 
sample matrix using column chromatography with the boron-specific ion exchange 
resin Amberlite 743°". 

Boron isotope composition was analysed by MC-ICPMS by sample-standard 
bracketing*”**°!. Early analyses used NH; to improve boron washout in the spray 
chamber™, which reduces background signals to ~3% of the preceding sample 
within ~3 min*. More recent analyses used dilute HF°?, which reduces background 
signals to ~0.5% within ~3 min. In contrast to some previous work, where samples 
were analysed in pure 0.3 M HF®>, we add a small volume of concentrated HF to 
our sample following column elution in 0.5 M HNO:, giving a solution of 0.5 M 
HNO; + 0.3 M HE Similarly, bracketing standards (NIST 951) and instrument 
blank acid were analysed in 0.5 M HNO; + 0.3 M HF to ensure consistent mass 
bias and blank corrections, though beyond improved washout we did not find any 
significant influence of HF concentration: boric acid standards run as dummy 
samples with HF concentrations from 0 to 0.5 M HF bracketed against 951 in 
0.5 M HNO; + 0.3 M HF all yielded identical boron isotope ratios. Carbonate 
standards (JCP, NIST RM8301C) passed through columns and run with NH; or 
HF also yielded identical values. 

All preparation and analytical work was carried out in boron-free clean 
laboratory conditions. Over the course of this work samples were analysed at the 
University of Bristol and Caltech on a Neptune MC-ICPMS, and the University of 
St Andrews on a Neptune Plus MC-ICPMS, though in all cases following nearly 
identical protocols. Each of these laboratories has taken part in published and 
ongoing inter-laboratory comparison studies* and there is no analytical offset 
between samples run in these laboratories. 

Long-term analytical reproducibility on 6!'B measurements in these laboratories 
(assessed with carbonate standards given the same treatment as samples) is around 
0.23 %o (2 s.d.) on samples of the size used here (~20 ng boron)—this is the error 
bar given on individual subsamples (open symbols in figures). We use a more 
conservative uncertainty for our mean coral 6!!B values (filled symbols in figures), 
to account for the 6"'B variability between subsamples from the same coral. This 
is based on the pooled s.d. of the replicate samples in our records and in Extended 
Data Fig. 6 (2 s.d.=0.51 %o). Uncertainty is reduced on mean values with multiple 
replicates (n), which is accounted for using the standard error (s.d./ Vn). Data from 
individual coral subsamples (open symbols in figures) are given in Extended Data 


Table 1 and average values from a given coral specimen (closed symbols and lines 
in figures) are given in Extended Data Table 2. 

51!B and pH in deep-sea corals. The boron isotope pH proxy provides a sensitive 
measure of the ocean carbonate system, though in common with many proxies, 
is also influenced by modification during biomineralization®*~*’. In particular, 
coral §!!B is influenced by internal pH elevation during biomineralization®*”, 
which may buffer its sensitivity to external seawater pH changes in some settings. 
To examine this we have compiled modern D. dianthus 6"'B calibration data***° 
from water depths >100 m, and have added two recent Southern Ocean samples 
(Extended Data Table 3). This indicates that the relationship between seawater pH 
and coral §"'B is curved, with 6!!B becoming more sensitive to external seawater 
at lower pH (Extended Data Fig. 5). This suggests that corals find it harder to ele- 
vate internal pH as external conditions become more acidic, which is reasonable 
given the increase in energy demand required for additional proton expulsion“). 
It also means that corals are likely to be more sensitive to external pH conditions 
at lower pH sites, such as the Southern Ocean locations in this study. Hence, the 
pH changes that occurred during the last 40,000 years are evident as large, easily 
resolvable changes in "'B in our coral record. 

Reconstructed pH based on this calibration is shown in Extended Data Fig. 1. 
Given the paucity of modern deep-sea coral samples from low-pH waters, our 
record extends beyond the available calibration range. As a result, conversion to 
absolute pH values carries relatively large uncertainty, which is hard to assess. We 
thus prefer to focus on relative changes in the 6!!B records themselves, which pro- 
vide a proxy of carbonate chemistry in their own right™, analogous to the typical 
use of 80 records in paleoceanography. 


Data availability 


The data produced in this study are available in Extended Data Tables and will also 
be made available at the NOAA (https://www.ncdc.noaa.gov/paleo/study/25230) 
and Pangaea data repositories. 
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Extended Data Fig. 1 | Deep Southern Ocean CO; chemistry and calibration in Extended Data Fig. 5, but uncertainty on this calibration is 
atmospheric CO, over the last 40,000 years. Green triangles and blue large (inset error bar), given the paucity of modern deep-sea coral data 
diamonds show lower and upper cell deep-sea coral §!'B data, respectively. from low pH waters. Instead we focus on the 6''B values themselves, which 
Individual subsamples are shown as small open symbols and mean values provide a proxy of carbonate chemistry in their own right®. Synchronized 
as larger filled symbols. Error bars on individual subsamples are equivalent ice core CO, data*® are shown in grey symbols: circles from Dome C, dots 
to 2 s.d. analytical reproducibility and error bars on mean coral values from WAIS, stars from Taylor Dome, triangles from TALDICE, pluses 
represent 2 s.e. uncertainty on the mean of replicate subsamples (see from EDML, diamonds from Byrd, and squares from Siple Dome. Grey 


Methods). Approximate pH values are given based on coral 6''B using the bands highlight intervals of CO; rise. 
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Extended Data Fig. 2 | Deep Southern Ocean CO; chemistry and 


simultaneous with a change in depth is the decrease at ~11.5 kyr ago, but 
atmospheric CO, over the last 40,000 years, highlighting the depths the jump back up to higher 6''B values following this event occurs without 
of upper cell corals. Symbols and data are as plotted in Extended Data a change in depth, giving confidence that the excursion is not a depth- 
Fig. 1, but with the addition of the lower panel and annotations showing related signal. Furthermore the large excursion at ~14.7 kyr ago occurs 
the depth in metres of each upper cell coral sample. No systematic offsetis without a significant change in depth. Note that all of the lower cell corals 
seen between samples from different depths. The only signal that occurs 


come from within 17 m water depth of each other. 
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Extended Data Fig. 3 | Records of Southern Ocean biogeochemistry (crosses). The opal flux records from each core are shown on separate 


and CO), over the last 40,000 years. Data are plotted as in Fig. 2, but with scales. The coral §!°N data are grouped into samples from the Antarctic 
opal flux’, a proxy for upwelling, deep sea coral §!°N®, a proxy for surface zone (AZ, blue) and Subantarctic zone (SAZ, red); smoothed fits to the 


ocean nitrate consumption, and authigenic uranium concentrations!®, data are shown, as provided in the original study”. Intervals of low CO 
a proxy for bottom water redox. The opal flux and authigenic uranium during the last ice age are associated with low upwelling, an efficient 
records combine two sediment cores: TN057-13-4PC in the younger part biological pump, low oxygen water rich in respired carbon, and low-pH 
of the record (pluses) and TN057-14PC in the older part of the record carbon-rich water in the deep Southern Ocean. 
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surface ocean-atmosphere CO, difference, based on 6!'B in planktic of upwelling of '*C-depleted waters and improved ventilation over the 
foraminifera”®, and radiocarbon data*”* from corals within these sample deglaciation. 


groupings, shown as '4C age offsets compared to the contemporaneous 
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Extended Data Fig. 5 | Boron isotope calibration for modern relationship between 6"!B of borate and pH is also somewhat influenced 
D. dianthus. Data are from open ocean sites*+°®, with two additional by water temperature, salinity, and depth™. A power law function 
recent (<1,650 years ago) samples from the Southern Ocean from this was fitted to the data using Matlab’s curve fitting toolbox (solid line: 
study. Water column 6!'B of borate (B(OH),4_) values are as previously 8" Bcoral = —1.82!* x 6" Baonya— |??? + 27.03; R? = 0.57). Dashed lines 
published or are calculated from carbonate chemistry data from nearby show the 95% confidence intervals and give a measure of calibration 
GLODAPV? sites for the new samples, as described*”**, Note that the uncertainty as shown in the error bar in Extended Data Fig. 1, although 
sensitivity of §''B in carbonates to pH is based on the pH sensitivity data from a given site may be able to record relative changes in pH more 
of &''B of borate. pH itself is not easily shown on a plot like this, as the sensitively, as seen in many paleo-proxies. 
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Extended Data Table 1 | 5!“B data for all fossil D. dianthus coral samples and subsamples (open symbols in Figures) 


Cruise Dredge Sample Latitude Longitude Depth Group Location Age Ageerror §"B  8''Berror 
(°N) (°E) (m) (yrBP) (yr2SD) (%o) (%o 2SD) 
NBP0805 DR23 Dc-A-7 -60.182 -57.828 819 LowerCell SFZ 11850 62 26.05 0.23 
NBP0805 DR23 Dc-A-7 -60.182 -57.828 819 LowerCell SFZ 11850 62 25.99 0.23 
NBP0805 DR23 Dc-A-5 -60.182 -57.828 819 LowerCell SFZ 13127 77 24.53 0.23 
NBP0805 DR23 Dc-A-5 -60.182 -57.828 819 LowerCell SFZ 13127 77 25.07 0.23 
NBP0805 DR23 Dc-A-6 -60.182 -57.828 819 LowerCell SFZ 15629 49 24.15 0.23 
NBP0805 DR23 Dc-A-6 -60.182 -57.828 819 LowerCell SFZ 15629 49 25.04 0.23 
NBP1103 DH40 Dc-3 -60.179 -57.837 806 LowerCell SFZ 18047 168 22.99 0.23 
NBP1103 DH40 Dc-3 -60.179 -57.837 806 LowerCell SFZ 18047 168 23.18 0.23 
NBP1103 DH43 Dc-6 -60.179 -57.003 823 LowerCell SFZ 19631 189 23.12 0.23 
NBP1103 DH43 Dc-1 -60.179 -57.003 823 LowerCell SFZ 20367 58 23.87 0.23 
NBP1103 DH43 Dc-1 -60.179 -57.003 823 LowerCell SFZ 20367 58 23.90 0.23 
NBP1103 DH40 De-5 -60.179 -57.837 806 LowerCell SFZ 22578 81 23.68 0.23 
NBP1103 DH40 Dc-5 -60.179 -57.837 806 LowerCell SFZ 22578 81 23.53 0.23 
NBP0805 DR23 Dc-A-1 -60.182 -57.828 819 LowerCell SFZ 25587 160 23.03 0.23 
NBP1103  DH43 Dc-3 -60.179 -57.003 823 LowerCell SFZ 27159 122 22.46 0.23 
NBP1103 DH43 Dc-3 -60.179 -57.003 823 LowerCell SFZ 27159 122 22.86 0.23 
NBP1103 DH43 Dc-12 -60.179 -57.003 823 LowerCell SFZ 28679 329 24.01 0.23 
NBP1103 DH43 Dc-12 -60.179 -57.003 823 LowerCell SFZ 28679 329 24.13 0.23 
NBP1103 DH43 Dc-12 -60.179 -57.003 823 LowerCell SFZ 28679 329 23.69 0.23 
NBP0805 DR23 Dc-A-2 -60.182 -57.828 819 LowerCell SFZ 35406 306 23.81 0.23 
NBP0805 DR23 Dc-A-3 -60.182 -57.828 819 LowerCell SFZ 39193 301 25.15 0.23 
NBP0805 DR23 Dc-A-3 -60.182 -57.828 819 LowerCell SFZ 39193 301 24.76 0.23 
NBP0805 DR23 Dc-A-3 -60.182 -57.828 819 LowerCell SFZ 39193 301 24.64 0.23 
NBP0805 DR35 Dc-A-1 -59.721 -68.883 695 UpperCell Sars 1649 13 26.16 0.23 
NBP0805 DR35 Dc-A-1 -59.721 -68.883 695 UpperCell Sars 1649 13 25.85 0.23 
NBP0805 DR34 Dc-A-2 -59.732 -68.746 869 UpperCell Sars 5759 30 26.30 0.23 
NBP0805 DR34 Dc-A-2 -59.732 -68.746 869 UpperCell Sars 5759 30 26.02 0.23 
NBP0805 DR34 Dc-A-1 -59.732 -68.746 869 UpperCell Sars 6798 34 26.54 0.23 
NBP0805 DR34 Dc-A-1 -59.732 -68.746 869 UpperCell Sars 6798 34 25.55 0.23 
NBP0805 DR35 Dc-C-2 -59.721 -68.883 695 UpperCell Sars 7639 28 25.59 0.23 
NBP0805 DR35 Dc-D-4 -59.721 -68.883 695 UpperCell Sars 10084 34 25.36 0.23 
NBP0805 DR35 Dc-D-4 -59.721 -68.883 695 UpperCell Sars 10084 34 26.22 0.23 
NBP0805 DR35 Dc-D5 -59.721 -68.883 695 UpperCell Sars 10255 81 26.03 0.23 
NBP0805 DR35 Dc-D-3 -59.721 -68.883 695 UpperCell Sars 11084 34 24.94 0.23 
NBP0805 DR35 Dc-B-1a -59.721 -68.883 695 UpperCell Sars 11317 36 24.87 0.23 
NBP0805 DR35 Dc-B-1a -59.721 -68.883 695 UpperCell Sars 11317 36 24.93 0.23 
NBP0805 DR38 Dc-A-1 -59.741 -68.900 978 UpperCell Sars 11549 64 26.74 0.23 
NBP1103. DH117 = Dn-1 -59.764 -68.936 981 UpperCell Sars 12680 102 26.82 0.23 
NBP1103  DH117 = Dn-7 -59.764 -68.936 981 UpperCell Sars 14143 80 26.90 0.23 
NBP1103. DH117 = Dn-7 -59.764 -68.936 981 UpperCell Sars 14143 80 26.35 0.23 
NBP1103. DH117 = Dn-7 -59.764 -68.936 981 UpperCell Sars 14143 80 26.23 0.23 
NBP1103. DH117 = Dc-20 -59.764 -68.936 981 UpperCell Sars 14177 122 25.71 0.23 
NBP1103  DH117 = Dc-1 -59.764 -68.936 981 UpperCell Sars 14475 111 26.34 0.23 
NBP1103. DH117 = Dc-29 -59.764 -68.936 981 UpperCell Sars 14609 74 25.83 0.23 
NBP1103. DH117 ~=Dc-29 -59.764 -68.936 981 UpperCell Sars 14609 74 25.75 0.23 
NBP1103  DH74 Dc-3 -60.606 -66.004 1064 UpperCell Interim 14714 122 23.80 0.23 
NBP1103 DH74 Dce-3 -60.606 -66.004 1064 UpperCell Interim 14714 122 24.90 0.23 
NBP1103. DH117 = Dc-36 -59.764 -68.936 981 UpperCell Sars 14889 80 24.90 0.23 
NBP1103. DH117 = Dc-36 -59.764 -68.936 981 UpperCell Sars 14889 80 25.74 0.23 
NBP1103 DH75 De(f)-37 -60.613 -66.002 1196 UpperCell Interim 15221 72 25.54 0.23 
NBP1103. DH117 Dc-09 -59.764 -68.936 981 UpperCell Sars 16851 168 26.14 0.23 
NBP0805 DR27 Dc-A-1 -60.546 -65.953 1134 UpperCell Interim 21466 109 24.90 0.23 
NBP0805 DR27 Dc-A-1 -60.546 -65.953 1134 UpperCell Interim 21466 109 24.71 0.23 


This table is also available as a spreadsheet in the Supplementary Information and on Pangaea and NCDC. Location groupings are: SFZ, Shackleton fracture zone; Sars, Sars Seamount; Interim, 
Interim Seamount (see map in Fig. 1). 
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Extended Data Table 2 | Averaged 5!“B data from each D. dianthus coral specimen (filled symbols in figures) 


Cruise Dredge Sample Latitude Longitude Depth Group Location Age Ageerror §"'B n §"'B error 
(N) (°E) (m) (yrBP) (yr2SD) (%o) (%0 2SD) 
NBP0805 DR23_ Dc-A-7 ~60.182 “57.828 819 LowerCell SFZ 11850 62 26.02. +2 0.36 
NBP0805 DR23. Dc-A-5 -60.182 -57.828 819 — LowerCell SFZ 13127 77 24.80 2 0.36 
NBP0805 DR23  Dc-A-6 -60.182 -57.828 819 — LowerCell SFZ 15629 49 24.59 2 0.36 
NBP1103. DH40 ——De-3 -60.179 -57.837 806 —_LowerCell SFZ 18047 168 23.08 2 0.36 
NBP1103_ DH43.—_Dc-6 -60.179 -57.003 823 LowerCell SFZ 19631 189 2312 1 0.51 
NBP1103 DH43.—De-1 -60.179 -57.003 823 LowerCell SFZ 20367 58 23.89 2 0.36 
NBP1103. DH40 —De-5 -60.179 -57.837 806 —_ LowerCell SFZ 22578 81 23.61 2 0.36 
NBP0805 DR23_ Dc-A-1 -60.182 -57.828 819  LowerCell SFZ 25587 160 23.031 0.51 
NBP1103_ DH43.—_De-3 -60.179 -57.003 823 LowerCell SFZ 27159 122 22.66 2 0.36 
NBP1103. DH43.—De-12 -60.179 -57.003 823 LowerCell SFZ 28679 329 23.94 3 0.29 
NBP0805 DR23. Dc-A-2 -60.182 -57.828 819 — LowerCell SFZ 35406 306 23.81 1 0.51 
NBP0805 DR23. Dc-A-3 -60.182 -57.828 819 — LowerCel SFZ 39193 301 2485 3 0.29 
NBP0805 DR35 —Dc-A-1 59.721 -68.883 695 — UpperCel Sars 1649 13 26.00 2 0.36 
NBP0805 DR34__——Dc-A-2 -59.732 -68.746 869 — UpperCel Sars 5759 30 26.16 2 0.36 
NBP0805 DR34 ——Dc-A-1 -59.732 -68.746 869 — UpperCel Sars 6798 34 26.04 2 0.36 
NBP0805 DR35 _—De-C-2 59.721 -68.883 695 — UpperCel Sars 7639 28 2559 1 0.51 
NBP0805 DR35 —De-D-4 59.721 -68.883 695 — UpperCel Sars 10084 34 25.79 2 0.36 
NBP0805 DR35 ~——De-D5 59.721 -68.883 695 — UpperCel Sars 10255 81 26.031 0.51 
NBP0805 DR35 —De-D-3 -59.721 -68.883 695 — UpperCel Sars 11084 34 24.94 1 0.51 
NBP0805 DR35 —De-B-1a 59.721 -68.883 695 — UpperCel Sars 11317 36 24.90 2 0.36 
NBP0805 DR38 —_Dc-A-1 -59.741 -68.900 978 — UpperCel Sars 11549 64 26.74 1 0.51 
NBP1103_ DH117_— Dn-1 -59.764 -68.936 981 — UpperCel Sars 12680 102 26.82 1 0.51 
NBP1103  DH117_— Dn-7 -59.764 -68.936 981 — UpperCel Sars 14143 80 2649 «3 0.29 
NBP1103_ DH117 —_Dc-20 -59.764 -68.936 981 UpperCell Sars 14177 122 25.71 1 0.51 
NBP1103. DH117—_De-1 -59.764 -68.936 981 UpperCell Sars 14475 111 26.34 1 0.51 
NBP1103_ DH117 —_Dc-29 -59.764 -68.936 981 UpperCell Sars 14609 74 25.79 2 0.36 
NBP1103_ DH74 —De-3 -60.606 -66.004 1064 UpperCell Interim 14714 122 24.35 «2 0.36 
NBP1103 DH117 —Dc-36 -59.764 -68.936 981 UpperCell Sars 14889 80 25.32 2 0.36 
NBP1103. DH75 _De(f)-37 -60.613 -66.002 1196  UpperCell Interim 15221 72 25.54 1 0.51 
NBP1103 DH117 —_Dc-09 -59.764 -68.936 981 — UpperCell Sars 16851 168 2614 1 0.51 
NBP0805 DR27 —Dc-A-1 -60.546 -65.953 1134 UpperCell Interim 21466 109 24.81 2 0.36 


This table is also available as a spreadsheet in the Supplementary Information and on Pangaea. Location groupings are: SFZ, Shackleton fracture zone; Sars, Sars Seamount; Interim, Interim Sea- 
mount (see map in Fig. 1). Uncertainty on mean coral 6!!B values is based on the pooled s.d. of all the replicates in this study (including those shown in Extended Data Fig. 6), divided by square root of 
n, to give a measure of the s.e. 
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Extended Data Table 3 | D. dianthus 5"“B calibration data (as shown in Extended Data Fig. 5) 


8" Boral 
Study Sample Depth Latitude Longitude T Ss ALK DIC PH — Qaragorite 8 Bporate 8""Beora’ 8'"Beora Error. += Subsamples 
(m) (°N) (°E) (°C) (psu) (umol/kg) (umol/kg) total scale (%0) (%o) (%0 2SD) (%o) 

McCulloch et al., 2012 Tasman Seamount DD_MS 932 40.76 29.17 14.5 38.8 2610 2469 7.77 1.46 15.23 25.78 0.31 25.68, 25.87 
McCulloch et al., 2012 | Marmara Sea Hill_B1 1050 44.33 147.28 4.59 34.4 2315 2217 7.87 1.02 14.98 26.14 0.31 

McCulloch et al., 2012 | MedCor-25-D 576 35.52 14.18 13.78 38.75 2613 2314 8.1 2.88 17.75 27.36 0.31 

McCulloch et al., 2012 | MedCor-74-D 837 36.75 13.99 13.96 38.77 2624 2347 8.05 2.59 17.40 26.8 0.31 

Anagnostou et al., 2012 19249 274 34.00 -119.50 7.8 34.11 2271 2226 7.69 0.83 14.06 25.95 0.25 25.89, 26.08 
Anagnostou et al., 2012 47409 673 54.5 -39.40 1.8 34.69 2336 2238 7.91 1.08 14.89 26.69 0.25 26.63, 26.82 
Anagnostou et al., 2012 47413 421 50.6 167.60 5.9 34.26 2284 2124 8.05 1.63 16.11 26.83 0.25 26.81, 26.89 
Anagnostou et al., 2012 48473 1107 47.70 -8.10 8.5 35.55 2348 2182 7.97 1.5 16.08 26.06 0.25 

Anagnostou et al., 2012 48739 825 47.60 -7.30 9.6 35.53 2344 2172 7.98 1.64 16.15 27.88 0.25 

Anagnostou et al., 2012 48740 1470 48.70 -10.90 5.1 35.12 2324 2169 7.99 1.28 16.03 26.99 0.25 

Anagnostou et al., 2012 62309 522 40.40 -67.70 6.1 35.05 2317 2179 7.97 1.43 15.60 25.92 0.25 25.82, 26.02 
Anagnostou et al., 2012 78630 312 46.80 -130.80 5.8 33.96 2280 2265 7.61 0.65 13.66 23.56 0.25 

Anagnostou et al., 2012 80358 358 48.00 -7.90 1 35.54 2336 2129 8.06 2.11 16.77 25.81 0.25 

Anagnostou et al., 2012 83583 464 32.90 -127.80 5.8 34.1 2290 2284 7.57 0.59 13.56 24.98 0.25 24.95, 25.04 
Anagnostou et al., 2012 94069 710 -30.5 -178.70 74 34.49 2284 2146 7.95 1.38 15.63 27.62 0.25 27.48, 27.89 
Anagnostou et al., 2012 29725 276 45.2 171.60 8.1 34.51 2288 2130 8.01 1.68 15.97 26.32 0.25 

Rae - this study NBP0805_DR40_DCA1 1323 -69.73 -68.93 2.13 34.67 2347 2259 7.87 0.91 14.86 25.91 0.23 25.71, 25.95, 26.06 
Rae - this study NBP0805_DR35_DCA1 695 -59.72 -68.88 2.45 34.46 2320 2237 7.88 1.00 14.72 26.00 0.23 25.85, 26.16 


This table is also available as a spreadsheet in the Supplementary Information and on Pangaea. Latitude, longitude, and depth values are given as averages where samples were collected by dredge. 
DIC in the McCulloch et al. (2011)55 sample set has been calculated from reported ALK and pH. Carbonate system parameters in the Anagnostou et al. (2011)5° sample set are as re-calculated in 
Stewart et al. (2016)*4, using GLODAP ALK and DIC. Carbonate system parameters for new data reported here are calculated from GLODAPv2 ALK and DIC. Coral subsamples represent different solid 
pieces/powder subsampled from the same specimen. Note that Anagnostou et al. (2011)°° give a weighted average of these values, which is reported in the 6!!Beorai column. 
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Social regulation of a rudimentary organ generates 
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The origin of complex worker-caste systems in ants perplexed 
Darwin! and has remained an enduring problem for evolutionary 
and developmental biology”~®. Ants originated approximately 
150 million years ago, and produce colonies with winged queen and 
male castes as well as a wingless worker caste’. In the hyperdiverse 
genus Pheidole, the wingless worker caste has evolved into two 
morphologically distinct subcastes—small-headed minor workers 
and large-headed soldiers®. The wings of queens and males develop 
from populations of cells in larvae that are called wing imaginal 
discs’. Although minor workers and soldiers are wingless, vestiges or 
rudiments of wing imaginal discs appear transiently during soldier 
development”?""!. Such rudimentary traits are phylogenetically 
widespread and are primarily used as evidence of common descent, 
yet their functional importance remains equivocal’!*-!*, Here we 
show that the growth of rudimentary wing discs is necessary for 
regulating allometr y—disproportionate scaling—between head and 
body size to generate large-headed soldiers in the genus Pheidole. We 
also show that Pheidole colonies have evolved the capacity to socially 
regulate the growth of rudimentary wing discs to control worker 
subcaste determination, which allows these colonies to maintain the 
ratio of minor workers to soldiers. Finally, we provide comparative 
and experimental evidence that suggests that rudimentary wing 
discs have facilitated the parallel evolution of complex worker- 
caste systems across the ants. More generally, rudimentary organs 
may unexpectedly acquire novel regulatory functions during 
development to facilitate adaptive evolution. 

The evolution of large-headed soldiers is thought to have promoted 
the adaptive radiation of Pheidole, which has produced approximately 
1,000 species worldwide®!>. Pheidole soldiers are larger than minor 
workers and their heads are disproportionally larger than their bodies, 
forming an allometric line that is distinct from that of minor workers 
(Fig. lac). A developmental switch—largely controlled by nutrition® 
and mediated by juvenile hormone'°—determines whether larvae 
develop into minor workers or soldiers (Fig. 2a). Minor worker larvae 
lack wing rudiments (Fig. 2c), whereas soldier larvae have one pair 
of rudimentary forewing discs® (Fig. 2d). In Pheidole obtusospinosa, 
which has evolved a disproportionately larger ‘supersoldier’ subcaste 
(Fig. 1d-f), supersoldier larvae have two pairs of large rudimentary 
wing discs’ (Fig. 1g-i). In addition, using Pagel’s!” phylogenetic 
correlation method, we found that the size of rudimentary wing discs 
varies discretely between worker subcastes within ant species that 
have independently evolved a soldier subcaste (Extended Data Fig. 1 
and Extended Data Table 1). Finally, rudimentary forewing discs in 
Pheidole soldier larvae are coordinated in their growth, gene expres- 
sion and apoptosis—just after larvae become determined as soldiers 
(Fig. 2a, red arrowhead), rudimentary forewing discs appear’, grow 
rapidly? (Extended Data Fig. 2), activate expression of genes in the wing 
network”? and are finally eliminated by apoptosis''. These findings 
raise the possibility that rudimentary wing discs have a functional role 
in the development of soldiers. 


We tested this possibility in Pheidole hyatti by targeting vestigial (vg), 
which in Drosophila is a selector gene that coordinates growth and 
patterning of wing imaginal discs and is necessary and sufficient for 
wing development'®”. Spatial expression of vgis similar in P. hyatti 
and Drosophila—in embryos vg is expressed in wing primordia and the 
ventral nerve cord, but in larvae vg expression could be detected only in 
the wing discs of winged castes and in the rudimentary forewing discs 
of soldiers (Fig. 2b—d and Extended Data Fig. 3a-o). We therefore used 
RNA-mediated interference (RNAi) to knockdown vg expression in 
soldier-destined larvae (Fig. 2a, red arrowhead). Compared to controls, 
vg RNAi significantly reduces the size of rudimentary forewing discs 
(Fig. 2e-g) and, as in Drosophila wing discs, perturbing vg expression 
induces apoptosis (Extended Data Fig. 4). We discovered that reducing 
the size of rudimentary forewing discs affects the adult phenotype: 
it significantly reduces the head/body ratio and changes the head-to-body 
slope such that head size is reduced more than body size, relative to 
controls (Fig. 2h, iand Extended Data Fig. 5a-f). These ants vary widely 
in head and body size, from being as small as minor workers to being 
as large as soldiers—some ants are intermediate in size, which are var- 
iants that do not exist in nature (Fig. 2i and Extended Data Fig. 5g-I). 
Finally, vg has also been shown to be expressed outside of wing discs 
in Drosophila larvae*'. To confirm that vg RNAi changes head-to-body 
scaling by specifically reducing the size of rudimentary forewing discs, 
we electro-surgically ablated the left rudimentary forewing disc in 
soldier-destined larvae; as a control, we ablated the third leg disc on the 
left side (Fig. 2a, red arrowhead and Extended Data Fig. 5m-q). These 
ablations produced results similar to vg RNAi (Fig. 2j, k and Extended 
Data Fig. 5r-t). Together, these experiments show that rudimentary 
forewing discs are necessary to regulate the size and disproportionate 
head-to-body scaling that generates large-headed soldiers. 

Minor worker larvae lack rudimentary wing discs and do not express 
vg in any other imaginal discs (Fig. 2c and Extended Data Fig. 3e, h, k, n). 
However, we found low levels of expression of vg outside of imaginal 
discs in minor worker larvae (Extended Data Fig. 3p). To test whether 
this expression influences head-to-body scaling, we treated bipoten- 
tial larvae (Fig. 2a, orange arrowhead)—of which 90-95% develop 
into minor workers, and 5-10% into soldiers!®—with vg RNAi. The 
bipotential larvae treated with vg RNAi that became minor workers 
did not show significant differences in size, head/body ratio, slope 
or intercept compared to controls (Extended Data Fig. 6a, b, e, g-i), 
whereas the few that developed into soldier-destined larvae became 
intermediates (Extended Data Fig. 6c, d, f). Therefore, expression of 
vg outside of imaginal discs does not influence size or head-to-body 
scaling in the minor-worker subcaste. We then investigated whether 
wing discs in winged castes regulate head-to-body scaling. Relative to 
controls, we found that in larvae destined to become males (Fig. 2a, 
green arrowhead) vg RNAi causes defects in patterning and growth 
of adult fore- and hindwings (Extended Data Fig. 6j-l), but has no 
significant effect on size, head/body ratio, slope or intercept (Extended 
Data Fig. 6m-p). Therefore, wing discs in male larvae produce wings 
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Fig. 1 | Head-to-body allometry and rudimentary wing discs of Pheidole 
worker subcastes. a, P. hyatti minor worker adult (left) and soldier adult 
(right). b, P hyatti minor worker pupa (left) and soldier pupa (right). 

c, log-log plot of head-to-body allometry of wild-type minor workers 
(n=155) and soldiers (n= 80) (x and y axes in pm). d-i, Comparing 

P. obtusospinosa worker subcastes and rudimentary wing discs 
(comparisons to scale): minor worker (MW, d, g), soldier (SD, e, h) and 
supersoldier (XSD, f, i). Arrowheads indicate rudimentary wing discs; 
asterisk indicates absence of a rudimentary wing disc. 


but do not regulate size or head-to-body scaling. These results show 
that, within the wingless female worker caste, rudimentary forewing 
discs regulate the size and disproportionate head-to-body scaling that 
is specific to the soldier subcaste. 

Pheidole colonies dynamically maintain a ratio of soldiers (5-10%) 
to minor workers (90-95%): soldiers use their large heads for defence 
and food processing, and minor workers forage and nurse young***-*». 
This ratio is maintained through the balance of two types of social 
regulation: (1) the activation of soldier development by minor workers 
through nutrition®, which is mediated by juvenile hormone’®; and (2) 
the suppression of soldier development through an inhibitory phero- 
mone when the number of soldiers increases to above 5-10%”**. This 
pheromone is contact-based and is thought to be composed of cuticular 
hydrocarbons”**”*, Because juvenile hormone induces rudimen- 
tary forewing disc growth in P. hyatti'®, we investigated whether their 
growth is also influenced by the inhibitory pheromone. We therefore 
raised soldier-destined larvae (Fig. 2a, red arrowhead) with either 
100% soldiers (high inhibition) or 100% minor workers (no inhibi- 
tion, as a control). We discovered that raising soldier-destined larvae 
with 100% soldiers significantly reduces the size of rudimentary fore- 
wing discs, relative to controls (Fig. 3a—c). This produced ants with a 
significantly reduced head/body ratio and changes the head-to-body 
slope such that head size is reduced more than body size, relative to 
controls (Fig. 3d, e and Extended Data Fig. 7a-f). These ants vary 
widely in size, ranging from minor workers to soldiers, and include 
intermediates (Fig. 3e and Extended Data Fig. 7g—-k). Finally, to rule 
out the possibility that these results are caused by any potential differ- 
ences in the rearing environments of minor workers and soldiers, we 
treated soldier-destined larvae with cuticular hydrocarbons extracted 
from soldiers and compared them to solvent-only controls under 
identical rearing conditions (100% minor workers). This produced 
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Fig. 2 | Rudimentary forewing discs regulate size and disproportionate 
head-to-body scaling of soldiers. a, Caste determination in Pheidole 

at three developmental switch points produces: winged males, winged 
queens, wingless minor workers and wingless soldiers. Points of 
experimental manipulation are indicated by coloured arrowheads: red, 
soldier-destined larvae; orange, bipotential larvae; green, male-destined 
larvae. JH, juvenile hormone. b-d, vg expression (purple) in larval wing 
discs of males or queens (b), minor workers (c) and soldiers (d). Black 
asterisk, absence of rudimentary wing disc. Scale bars, relative scale. 
e-f, Rudimentary forewing discs after yfp RNAi (control RNAi) (e) 

or vg RNAi (f). Rudimentary wing disc presence (white arrowheads) or 
absence (white asterisks). g, Comparing ratio of log(rudimentary 
forewing disc area (jum7)) to log(leg disc area (\tm*)) (log(WD (\1m?))/ 
log(LD (\1m?))) between control RNAi (n = 13) and vg RNAi (n= 16). 
The box plot shows mean (+), interquartile range (bars) and minimum 
to maximum values (whiskers); all points represent individual ants. 
Two-tailed Mann-Whitney U-test, U= 24, ***P= 0.0002. h, Wild-type 
minor worker and representative individuals treated with control RNAi 
or vg RNAi. i, Comparing slopes of control RNAi (n = 23) and vg RNAi 
(n=35); analysis of covariance (ANCOVA), F = 38.1, degrees of 
freedom (d.f.) =54, P< 0.0001. Experiments were repeated at least three 
times. j, Wild-type minor worker and treated individuals with either 
rudimentary forewing disc or leg disc ablated. Red asterisk, ablated leg. 
k, Comparing slopes of leg disc (n = 16) and rudimentary forewing 

disc ablations (n = 16); ANCOVA, F= 8.74, d.f. = 28, P=0.0063. Image 
comparisons are to scale. Experiments were repeated at least twice. All 
regressions are x axis (log(body length (1m)) versus y axis (log(head 
width (jum)). 
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Fig. 3 | Social inhibition regulates size and disproportionate head- 
to-body scaling by suppressing growth of rudimentary forewing 

discs. Manipulations are on soldier-destined larvae. Rudimentary wing 
disc presence (arrowheads) or absence (asterisks). a, b, Comparing 
rudimentary wing disc size after no inhibition (100% minor workers) 

(a) and high inhibition (100% soldiers) (b). c, Comparing ratios of 
log(rudimentary forewing disc area (j1m*)) to log(leg disc area (\1m7)) 
between 100% minor workers (n = 18) and 100% soldiers (n = 23). The 
box plot shows mean (+), interquartile range (bars) and minimum to 
maximum values (whiskers); all points represent individual ants. Two- 
tailed unequal variance t-test, t=5.77, d.f. = 36.13, ****P < 0.0001. 

d, Wild-type minor worker and representative individuals raised by 
100% minor workers or 100% soldiers. e, Comparing slopes of 100% 
minor workers (n = 24) and 100% soldiers (n = 35); ANCOVA, F= 36.55, 
d.f.=55, P< 0.0001; x axis (log(body length (j1m)) versus y axis (log(head 
width (j1m)). Image comparisons are to scale. Experiments were repeated 
at least three times. 
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results similar to the larvae raised entirely by soldiers (Extended Data 
Fig. 7l-p). Taken together, the soldier inhibitory pheromone regulates 
size and disproportionate head-to-body scaling by suppressing growth 
of rudimentary forewing discs. 

We next investigated whether rudimentary forewing discs medi- 
ate the interaction between the inhibitory pheromone and juvenile 
hormone to regulate size, head-to-body scaling and worker subcaste 
determination. Previous work” has demonstrated that the inhibitory 
pheromone acts downstream of juvenile hormone and can inhibit acti- 
vation of soldier development—bipotential larvae treated with juvenile 
hormone and raised entirely by soldiers can block soldier development, 
which results in the production of large minor workers”. In P. hyatti, 
we found that these large minor workers develop from larvae that lack 
rudimentary forewing discs and are proportionally larger; although the 
intercept is significantly different relative to controls, the head-to-body 
slope is not (Extended Data Fig. 8i-1). Conversely, treating bipotential 
larvae with juvenile hormone but without inhibitory pheromone (100% 
minor workers) produced soldiers that crossed the juvenile-hormone 
threshold, as well as large minor workers that did not (Extended Data 
Fig. 8a—d). These large minor workers, which develop from larvae 
with rudimentary forewing discs that prematurely stop growing, are 
disproportionately larger; the head-to-body slope is significantly dif- 
ferent compared to controls (Extended Data Fig. 8e-h). These results 
show that: (1) blocking growth of rudimentary forewing discs with 
inhibitory pheromone blocks the disproportionate head-to-body scaling 
that is induced by juvenile hormone; and (2) juvenile hormone 
regulates size and proportional head-to-body scaling independently 
of rudimentary forewing discs (Fig. 4a). Finally, to determine the role 
of inhibitory pheromone independent of rudimentary forewing discs, 
we exposed bipotential larvae to inhibitory pheromone (100% soldiers) 
without treatment with juvenile hormone. The minor workers that 
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Fig. 4 | The role of rudimentary wing discs in the social regulation, 
development and evolution of complex worker-caste systems in ants. 

a, Interactions (arrows and lines) may be direct or indirect. Green arrows, 
activation; dashed green arrows, potential pathways to disproportionate 
scaling; red arrows and lines, inhibition; grey arrows, social interactions; 
grey circle, rudimentary forewing discs; grey boxes, wing gene network. 

b, Experimental manipulations (red) and controls (black) in P hyatti. 

c, Wild-type C. floridanus workers”? (n= 179). d, Wild-type S. geminata 
workers”? (n = 239). Plots in b-d show log(body length (jum)) or log(scape 
length (j1m)) on the x axis, versus log(head width (j1m)) on the y axis. 

f, Comparison, to scale, of wild-type minor worker, large minor worker 
anomaly (XMW), wild-type soldier and supersoldier-like anomaly (XSD). 


result develop from larvae that lack rudimentary forewing discs and 
are proportionally smaller; although the intercept is significantly dif- 
ferent, the head-to-body slope is not (Extended Data Fig. 8m-p). Taken 
together, we show that juvenile hormone and the inhibitory pheromone 
regulate size and proportional head-to-body scaling independently of 
rudimentary forewing discs (Fig. 4a). By contrast, juvenile hormone 
and the inhibitory pheromone regulate size, disproportionate head-to- 
body scaling, and soldier subcaste determination, by directly or indi- 
rectly influencing the growth of rudimentary forewing discs (Fig. 4a). 

Multiple molecular mechanisms may underlie the function of rudi- 
mentary wing discs. Organs, including wing imaginal discs in insects, 
communicate through conserved signalling pathways to coordinate 
development?’. In flies, butterflies and beetles, eliminating or reduc- 
ing the growth of one imaginal disc either increases the size of others 
owing to resource competition, or has no adult phenotypic effect owing 
to homeostasis”®. By contrast, we show that eliminating or reducing 
the growth of a rudimentary wing disc decreases the head/body ratio. 
The rudimentary forewing discs may increase head growth second- 
arily, through their influence on body growth. Alternatively, because 
rudimentary forewing discs increase head size more than body size, 
they may increase head and body growth independently of each other 
(Fig. 4a, dashed lines). Therefore, this ancient communication system 
may have been exploited and modified in Pheidole, enabling rudimen- 
tary wing discs to regulate head-to-body allometry at the level of the 
individual and the subcaste ratio at the colony level in response to envi- 
ronmental variation and challenges**-”° (Fig. 4a). 

Wilson proposed? that worker-caste systems in ants—which range 
from being monomorphic to completely dimorphic—have evolved 
through a series of developmental transitions in allometry. Our exper- 
imental manipulations of the growth of rudimentary forewing discs 
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produced intermediate variants that fill the empty phenotypic space 
between minor workers and soldiers (compare Fig. 1c to Fig. 4b). These 
manipulations transform P hyatti from a completely dimorphic worker- 
caste system to a sigmoid-like allometry, which mimics the worker- 
caste system of other species such as Camponotus floridanus”® or 
Solenopsis geminata*® (Fig. 4b-d). Furthermore, we discovered rare and 
anomalously large minor workers in a wild P. hyatti colony (Fig. 4e), 
which suggests that the intermediates we produced experimentally 
can appear in nature but that selection for a robust juvenile-hormone- 
mediated threshold between minor workers and soldiers may limit 
their production. We also discovered rare supersoldier-like anomalies 
ina wild P. hyatti colony (Fig. 4e). Because an ancestral developmental 
potential to produce supersoldiers has been retained across Pheidole, 
such anomalies can be experimentally induced with juvenile hormone”. 
Supersoldier anomalies develop from larvae with two pairs of rudimen- 
tary wing discs'°, which mimics the development of supersoldiers in 
P. obtusospinosa’” (Fig. 1f, i). These findings show that rudimentary 
wing discs store an ancestral developmental potential that—when 
released—produces allometric variation that mimics the worker-caste 
systems of other species (Fig. 4b-d). This potential may facilitate 
‘Wilson’s transitions’ between different complex worker-caste systems, 
and their parallel evolution across the ants (Extended Data Fig. 1). 

The transient appearance of rudimentary organs is a general feature 
of organismal development, and is primarily used to infer common 
ancestry’'?"'*. Although the functional importance of rudimentary 
organs is enigmatic’!?"!*, we show that rudimentary wing discs have 
a major regulatory role during ant development and evolution. We 
propose that rudimentary organs may generally evolve key regulatory 
functions and, through their capacity to store and release ancestral 
developmental potential, may be an underappreciated source of varia- 
tion that fuels adaptive evolution. 
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METHODS 


Animal collection and culturing. Colonies of P. hyatti and P. obtusospinosa were 
collected in the southwest of Arizona, USA. C. floridanus colonies were collected 
as newly mated queens in Tallahassee, Florida, USA. Monomorium trageri colonies 
and S. geminata larvae were collected in Gainesville, Florida, USA. All colonies 
and experimental replicates were kept at 27°C with 60% humidity and a 12:12-h 
light:dark cycle. They were kept in fluon-coated plastic boxes with cotton- 
constrained glass water tubes as well as sugar water tubes. They were fed 
mealworms and Bhatkar-Whitcomb diet*!. Holes were inserted in the lids of the 
boxes and lined with mesh to allow better air exchange. All experiments were 
performed in accordance with invertebrate care guidelines and regulations. 
Testing for phylogenetic correlation. Pagel’s!? maximum likelihood based 
phylogenetic correlation test, as implemented in Mesquite 3.5°, was used to test 
for a phylogenetic correlation between the evolution of inter-subcaste variation in 
size of rudimentary wing discs and evolution of a soldier subcaste. One hundred 
extra iterations were used to set the intensity of the likelihood search. Pagel’s!” 
method first calculates the log-likelihood that the presence of inter-subcaste 
variation in the size of rudimentary wing discs and the presence of soldier subcastes 
evolved independently of each other along the phylogeny, and then calculates the 
log-likelihood that both traits evolved in a correlated manner along the phylogeny. 
It then uses a likelihood ratio test statistic (1,000 simulations of the data were used) 
to determine which of these two models best fits the data. For data on the size of 
rudimentary wing discs and soldier subcastes, all available comparative data from 
the literature and from the present study were collected, resulting in data points for 
21 species across 10 ant genera (Extended Data Fig. 1 and Extended Data Table 1). 
Phylogenetic relationships and branch-length information were obtained for all 
21 species (Extended Data Fig. 1a) from previous publications**“, and provide 
formal phylogenetic evidence that a soldier subcaste has evolved several times inde- 
pendently across the ants. For each species, the presence of inter-subcaste variation 
in size of rudimentary wing discs was coded as a ‘1’ if rudimentary wing discs were 
discretely larger in the soldier subcaste compared to the minor-worker subcaste 
(Extended Data Table 1). Inter-subcaste variation in size of rudimentary wing discs 
was coded as ‘0’ if inter-subcaste variation was absent and larvae in the colony 
had similar sizes of rudimentary wing discs (Extended Data Table 1). For each 
species, the presence of a soldier subcaste was coded as ‘1’ if it was demonstrated 
by previous studies (Extended Data Table 1). The absence of a soldier subcaste was 
coded as ‘0’ if it was demonstrated by previous studies (Extended Data Table 1). 
Isolation of vestigial homologues. Forward: 5'-TATCCTTACCTKTAYC 
ARACCC-3’ and reverse: 5/-GCTGACTATTCCAAAAGGARGG-3’ degenerate 
primers were designed based on a vestigial sequence alignment of Tribolium 
castaneum (XM_008201106), Apis mellifera (XM_001122002.4) and several ant 
species obtained from the Ant Genome Portal database (http://hymenopteragenome. 
org/ant_genomes/). P hyatti RNA was isolated using TRIzol (Invitrogen) from 
a pool of embryos and larvae of different developmental stages. RNA was then 
reverse-transcribed to synthesize a cDNA library. PCR amplicons were ligated into 
the pGEM-T easy vector (Promega) and subsequently sequenced (MH683613) 
using Sanger sequencing at the Genome Quebec Innovation Centre at McGill 
University. To determine whether these vestigial sequences—including the 
TONDU/vestigial domain—are conserved, amino acid alignment was performed 
using Geneious Alignment on Geneious (R8), followed by manual alignment 
(Supplementary Fig. 2). 

Whole-mount in situ hybridization. Terminal stage embryos and larvae were 
fixed and processed as previously described”*!~“9, Using a Zeiss Discovery V12 
stereomicroscope, fixed samples were then dissected to remove the gut and fat 
tissue to facilitate probe access to imaginal tissues. Digoxigenin-labelled riboprobes 
(Roche Diagnostics Canada) for vestigial and wingless’ were then synthesized from 
the cloned P. hyatti fragments and subsequently used for spatial gene expression 
profiling in embryos” and larvae**. Reproductive, soldier and minor worker larvae 
were pooled during in situ hybridization. 

Semi-quantitative reverse-transcription PCR of vestigial transcripts in 
P. hyatti. vestigial expression in wild-type terminal P. hyatti larvae was deter- 
mined through the detection of vestigial transcripts using semi-quantitative PCR 
with reverse transcription (Extended Data Fig. 3 and Supplementary Fig. 1). The 
housekeeping gene Elongation factor 1 alpha (EF 1a) was used as our reference 
gene because it has previously been validated as a reference gene for quantita- 
tive PCR with reverse transcription in social insects*’. Therefore, EF1a was first 
cloned and sequenced (MH683615) from a P. hyatti cDNA library using the 
following degenerate primers: forward 5’-GATTCYGGCAAGTCGACCA-3’ 
and reverse 5'-GGAACTCTTGGAAAGCCTCAAC-3’. The PCR product was 
sequenced using Sanger sequencing at the Genome Quebec Innovation Centre 
at McGill University. To extract RNA, three minor worker larvae and three sol- 
dier larvae at the terminal stage were collected from a laboratory colony and 
total RNA was extracted from minor workers and soldiers separately. The tissue 
was disrupted using a TissueLyser (Qiagen) bead mill and RNA was extracted 


using the TRIzol (Invitrogen) RNA extraction protocol“, and then purified using 
the RNeasy Plus Kit (Qiagen). Minor worker and soldier RNA was treated with 
DNase I (Invitrogen) to remove genomic DNA before being reverse transcribed 
into cDNA using the Superscript II First-Strand Synthesis System (Invitrogen). 
The concentrations of total RNA and total cDNA were normalized between 
minor workers and soldiers before cDNA synthesis and PCR, respectively. 
The two cDNA libraries were used as PCR templates for the semi-quantitative 
PCR with reverse transcription of vestigial and EFla. The P. hyatti vestigial PCR 
primers used were: forward 5’-TCCTTACCTGTATCAGACCCATC-3’ and 
reverse 5‘-TGTCGATCTGTCGTCGTCCAA-3’, and the P. hyatti EFla PCR 
primers used were: forward 5’-TCAGGACGTGTACAAGATC-3’ and reverse 
5/-CAATGACCTGTGCAGTAAAG-3’. The PCR was performed using an annealing 
temperature of 56°C with 31 thermocycles and four serial dilutions of the two 
cDNA libraries (500 ng, 250 ng, 125 ng and 62.5 ng) were used for the semi- 
quantitative PCR with reverse transcription. A no-template water control was used 
to confirm absence of contamination. 

Phospho-histone H3 (PH3) assay. To assay for proliferation, rudimentary wing 
imaginal discs from soldier-destined larvae measuring 2.2 mm, 2.5 mm and 
2.7 mm in size were fixed and dissected (as described in ‘Whole-mount in situ 
hybridization’), and then immunohistochemistry against PH3 was performed as 
previously described*’. Mouse monoclonal anti-PH3 (PH3 Ser10, Cell Signaling 
Technology, 9706S) primary antibody was used at a 1:25 dilution and the second- 
ary antibody, goat anti-mouse Alexa 555 (AbCam, AB150114) was used at a 1:500 
dilution. Samples were counter-stained with DAPI. 

RNAi. To functionally knock down vestigial, vg double-stranded RNA 
(dsRNA) was synthesized from the same plasmid from which the vg in situ 
riboprobe was synthesized. The fragment (not including primer regions) 
was 631 bp, which is of a size range that is known to be highly efficient in 
targeting imaginal discs of other insects**. T7-flanked (uppercase) vg forward 
5'-TAATACGACTCACTATAGGGtatccttacctgtaccagaccc-3’ and reverse 
5’-TAATACGACTCACTATAGGGegctgactattccaaaaggaggg-3’ primers were 
used to amplify a template for T7 RNA polymerase to synthesize the dsRNA*”””. 
dsRNA was then purified with the Qiagen RNAeasy purification kit followed by 
ethanol precipitation and eluted in Spradling injection buffer?! to a concentration 
of 3.5 mg/ml. 

Size-matched soldier-destined larvae were then injected with vestigial dsRNA 
or with yfp dsRNA or injection buffer as a control. Soldier-destined larvae can 
be defined as larvae that ranged from 2.2 to 2.75 mm, and these larvae possess a 
characteristic brown gut”’. Larvae at this stage were specifically injected, because 
this is when the rudimentary forewing imaginal discs appear and begin to grow 
in Pheidole’ and at this stage vg expression in the wing discs of Drosophila is 
required'*®. To test whether vg RNAi affects head-to-body scaling independently of 
the wing disc, size-matched bipotential larvae—which are defined as ranging from 
1.0 to 1.6 mm'°—were injected with vg dsRNA because 95% of bipotential larvae 
develop into minor worker larvae that lack rudimentary wing discs. Finally, to 
determine the effect of vg RNAi on wing imaginal discs that develop into functional 
wings, size-matched male-destined larvae were injected, and wing phenotypes 
were examined and head width and body size (Weber's length) were measured. 

To inject soldier-destined, bipotential and male larvae, the larvae were placed 
onto two-sided tape on a microscope slide and stabilized along a glass capillary 
tube. Needles were made using a Sutter Instrument needle puller (Model P-97) 
and inserted onto a Narishige microinjection apparatus attached to a Zeiss 
Discovery V8 dissection microscope fitted with a custom-made x-y movable 
platform. Microinjection flow pressure was controlled using a Cell Tram Vario 
5176 (Eppendorf). Larvae were injected, while on their dorsum, through their 
lateral side at the midline of the antero-posterior axis. After the injection, larvae 
were placed in replicates containing adult minor workers to care for them (in a 2:1 
adult:larva ratio) until they were either fixed before pupation and DAPI-stained 
for imaginal disc imaging and measurements, left to metamorphose into pupae 
for pupal imaging and measurements or—in the case of males—left to become 
adults for measurements. 

Efficiency and specificity of vg RNAi. Semi-quantitative reverse-transcription 
PCR was performed on terminal P hyatti soldier larvae after injection with vg 
RNAi or yfp RNAi to confirm knockdown of vg transcripts (Extended Data 
Fig. 4 and Supplementary Fig. 1). RNA was extracted from terminal soldier 
larvae 48 h post-injection and cDNA libraries were synthesized and standard- 
ized as described above. NADH (NADH-ubiquinone oxidoreductase subunit 8) 
was used as a reference gene as it has previously been validated as a reference 
gene for quantitative PCR in ants®**. Therefore, NADH was first cloned and 
sequenced (MH683614) from a P. hyatti cDNA library using the following 
degenerate primers: forward 5’-GGGBCCTTTACAGATAATTGCRC-3’ 
and reverse 5/-ATTCRAAGTTRGGACCCTCA-3’. The two cDNA libraries 
(vg RNAi and yfp RNAi) were used as PCR templates for the semi-quantitative 
reverse-transcription PCR of vestigial and NADH. The NADH P. hyatti 
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primers were: forward 5'-GGGAGAGCATGCGTTAAGAA-3’ and reverse 
5!-TAGCCTGTGCAGGACAAATC-3’. The vestigial primers targeting outside 
the dsRNA fragment were: forward 5’-CATTACCCACAGTACCATCACA-3/ and 
reverse 5'-CAGCAGAAGGCCACTGTAG-3’. The PCR was performed using an 
annealing temperature of 60°C with 35 thermocycles and four serial dilutions of 
the two cDNA libraries were used as template. A no-template water control was 
used to confirm absence of contamination. 

Finally, the TUNEL (TdT-mediated dUTP nick end labelling) assay was used to 
validate the specificity of vg RNAi because perturbation of vg expression is known 
to induce apoptosis in Drosophila wing imaginal discs. Apoptotic cell death was 
assayed using the In Situ Cell Death Detection Kit, AP (Roche Diagnostics). The 
protocol was carried out as in a previous publication®, with a modification of the 
proteinase K step to 5 min (50 g/ml). 

Ablations. The ablation protocol was modified from a previous publication™. 
Soldier-destined larvae were collected, as described in ‘RNAi. Leg and rudimen- 
tary wing disc ablations were done using a Hyfrecator 2000 electrosurgical unit 
(ConMed) with an electrochemically sharpened (1 M NaOH and 120 V) tungsten 
wire. The tungsten wire was fastened and wound around an extra-fine needle 
electrode (714, ConMed) to enable current to pass through the sharpened wire. 
All ablations were performed using 1.5 W of electrical current for 0.5-1 s. During 
cauterization, larvae were placed on their dorsum on a KimWipe moistened with 
PBS under a Zeiss Discovery V8 dissection microscope to enable visualization of 
the leg discs. The left rudimentary wing disc was cauterized using the position of 
the leg discs and boundaries of the thoracic segments as landmarks. To control for 
the effect of cauterization and for damage to imaginal tissue, the left posterior leg 
imaginal disc was cauterized using the same intensity and time of cauterization. 
After cauterization, larvae were placed in replicates containing adult minor workers 
to care for them (in a 2:1 adult:larva ratio) until they were either fixed before 
pupation and DAPI-stained for imaginal disc imaging or left to metamorphose into 
pupae for pupal imaging and measurements. Analysing the effect of control leg disc 
ablations on pupae and dissected larvae allowed us to confirm that cauterization 
was specific to the targeted tissue and did not damage surrounding imaginal tissues 
(Fig. 2) and Extended Data Fig. 5m-q). 

Soldier inhibition and juvenile hormone manipulations. Previous studies have 
shown that exposure of Pheidole larvae to a high density of adult soldiers reduces 
their potential to develop into soldiers owing to the production of a soldier inhib- 
itory pheromone”. In particular, bipotential larvae that were treated with a dose 
of the juvenile hormone analogue methoprene—which would normally induce 
soldier development—became minor workers when raised by 100% soldiers”*. In 
the absence of adult soldiers, treating size-matched bipotential larvae with methop- 
rene has previously been shown to induce soldier and supersoldier development in 
Pheidole and results in the development of rudimentary wing discs'®. To determine 
whether the inhibition of soldier development is mediated by rudimentary wing disc 
growth, sized-matched bipotential larvae were treated with methoprene (5 mg/ml, 
Sigma-Aldrich) or acetone vehicle control and raised by either 100% minor 
workers (no inhibition) or 100% soldiers (high inhibition) in a 2:1 adult:larva 
ratio. Larvae were either fixed before pupation for imaginal disc imaging or left to 
metamorphose into pupae for pupal imaging and measurements. To determine 
the effects of this inhibitory pheromone on soldier development, soldier-destined 
larvae were collected as described in ‘RNAi, and placed in replicates that contained 
100% adult soldiers (high inhibition) or 100% minor workers (no inhibition) in 
a 2:1 adult:larva ratio. Larvae were fixed before pupation and DAPI-stained for 
imaginal disc measurements or left to metamorphose into pupae for pupal imaging 
and measurements. To determine whether the inhibitory pheromone is involved 
in regulating head-to-body scaling independent of the disc, bipotential larvae— 
which do not possess rudimentary discs—were raised with 100% adult soldiers 
(in a 2:1 adult:larva ratio). These were additionally treated with acetone, to enable 
comparison to the acetone + 100% minor workers treatment described above. 
Individual ants were then either fixed as larvae for imaginal disc measurements 
or left to metamorphose for pupal measurements. 

Cuticular hydrocarbon extraction and application. To test whether the soldier 
inhibitory pheromone is capable of changing head-to-body scaling independent 
of rearing environment, soldier-destined larvae were treated with cuticular hydro- 
carbons extracted in a 2:1 adult:larva ratio from adult soldiers. The extraction 
and application protocol was modified from a previous publication®. To extract 
the surface cuticular hydrocarbons from adult soldiers, adult soldiers were frozen 
at —80°C for 3 min and then placed in a Teflon-capped borosilicate glass vial 
(Sigma-Aldrich) containing 50 j1l hexane per 10 soldiers for 2 min. Extracts were 
dried with high-purity nitrogen and resuspended in hexane. Soldier-destined larvae 
were treated with 1 jl of extract or hexane control and then placed in replicates 
containing adult minor workers to care for them (in a 2:1 adult:larva ratio). Larvae 
were left to metamorphose into pupae for pupal imaging and measurements. 
Microscopy and morphometrics. Larval and pupal imaging was conducted with 
a Zeiss Discovery V12 stereomicroscope, while the imaging of imaginal discs 
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(Bright Field, DIC and fluorescence) was performed with a Zeiss Axiolmager Z1 
microscope. Larval rudimentary wing discs stained for PH3 were imaged with 
a confocal Leica SP8 Point-Standing Confocal system on a Leica DMI6000B 
Inverted Microscope. Zeiss Axio Vision software was used to take larval, imaginal 
disc and pupal measurements. All rudimentary wing disc size measurements were 
performed on terminal stage larvae just before pupation, as this establishes a stand- 
ardized stage for comparisons between castes and experimental manipulations’. 
Furthermore, rudimentary wing disc size was compared to leg disc size to normalize 
for variation among individual ants within the terminal stage. Head width and 
body length of pupae was measured in a dorsal view because insects obtain their 
final adult shape and size after they metamorphose, and measurements on pupae 
are more repeatable and accurate than those on adults. 

Statistics and reproducibility. All statistical analyses were performed on Graphpad 
Prism v.7 and statistical analyses were considered statistically significant at a P 
value < 0.05. All measures were log-transformed before analysis, with the excep- 
tion of those used to calculate percentage change in head width and body length. 
To estimate measurement error, 20 wild-type minor worker and 20 wild-type 
soldier pupae were randomly selected and pupal head width and body length were 
measured. Each morphological trait was measured three times and the person 
taking the measures was blind to previous values. A random-effects ANOVA 
(RStudio, v.1.1.423) was used to estimate the measurement error in log-transformed 
values for head width and body length. The measurement error was found to 
make up a negligible percentage of total variance in head width (0.0599%) and 
body length (0.0822%). Therefore, ordinary least squares linear regression was 
used throughout because previous work*® has shown that the ordinary least 
squares method is more accurate than reduced major-axis regression when there 
is low measurement error in the variable on the x axis. No statistical methods 
were used to predetermine sample size, all experiments were independently 
repeated multiple times and converged on similar results as shown in the Figs. 
and Extended Data Figs. Blinding and randomization was used during all 
experimental manipulation. 

To test whether any of our manipulations (vg RNAi, ablations, high inhibition 
raised by 100% soldiers, soldier CHC extracts and/or juvenile hormone treatments) 
altered head-to-body scaling compared to their controls, ANCOVA was used to test 
for differences in slope and—if no change in slope was observed—differences in y 
intercept between linear regressions of treatment and the respective control were 
assessed. To compare the ratio of rudimentary wing disc/leg disc area, or the ratio 
of pupal head width/body length, head width, or body length, normality was first 
tested using the Shapiro- Wilk test. If normality was violated, a non-parametric 
Mann-Whitney U-test was used, and if not an unpaired t-test was used. Where 
necessary, a Welch's correction for unequal variances was used. All pairwise com- 
parisons were two-tailed except for (1) electrosurgical ablations; (2) percentage 
change in head width versus body length; and (3) cuticular hydrocarbon 
treatments, for which we used one-tailed tests based on a priori expectations 
derived from the results of our preliminary RNAi and soldier inhibition exper- 
iments. This workflow was used to determine whether vg RNAi, ablations, social 
and juvenile hormone manipulations, and CHC treatments differed compared to 
their respective controls for: (1) the ratio of average rudimentary wing disc area 
to average leg disc area; (2) the ratio of head width to body length; (3) head width; 
and (4) body length. For multiple pairwise comparisons of head width or body 
length between social and/or juvenile hormone manipulations and their controls, a 
Bonferroni correction was applied. Finally, Fisher’s exact test was used to determine 
whether the proportion of males with affected wing morphology differed between 
yfp RNAi control males and vg RNAi. 

Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability 
All relevant data are included in the paper. The raw data for all analyses used in this 
study are available from the corresponding author upon request. 
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Extended Data Fig. 1 | The phylogenetic correlation between 

presence of a soldier subcaste and presence of discrete inter-subcaste 
variation in rudimentary wing disc size. a, Pagel’s test (difference in log 
likelihoods = 8.43, P= 0.001) shows that there is a significant phylogenetic 
correlation between the presence of a soldier subcaste and the presence 
of discrete inter-subcaste variation in the size of rudimentary wing discs, 
for 21 species of ants. Grey lines indicate phylogenetic relationships, 

and the time scale of these relationships is indicated at the bottom in 
millions of years. The orange bar indicates presence of discrete inter- 
subcaste variation in the size of rudimentary wing discs, the green bar 
with two adjacent cartoons indicates presence of minor worker and 
soldier subcastes, and the dark green bar with three adjacent cartoons 
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indicates presence of minor worker, soldier and supersoldier subcastes. 
Inter-subcaste variation in the size of rudimentary wing discs in workers 
(W) and/or minor workers and/or soldiers and/or supersoldiers are at the 
tip of each branch, where the white-circle drawings represent presence 
and relative size, within species, of rudimentary wing discs and asterisks 
indicate absence of rudimentary wing discs. Extended Data Table 1 
provides references and descriptions. b-k, Adults and rudimentary wing 
discs of minor workers and/or soldiers and/or workers of C. floridanus 
(b-e), S. geminata (f-i) and M. trageri (j, k). d, e, h, i, k, Arrowheads 
indicate the presence of rudimentary wing discs and asterisks indicate 
the absence of rudimentary wing discs. All comparisons within species are 
to scale. 
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Extended Data Fig. 2 | Proliferation of rudimentary forewing discs in 
soldier-destined larvae. a—c, Immunohistochemistry of rudimentary the bottom right corner. All images are to scale. c’ provides an increased 
forewing discs in P. hyatti soldier-destined larvae. DAPI (magenta) stains magnification of c, to show nuclear co-localization of DAPI and PH3. 
all nuclei within the cells of rudimentary forewing discs, and phospho- Experiments were repeated twice. 

histone H3 (PH3; yellow) stains proliferating cells. The length of the 


larva from which the rudimentary forewing disc was dissected is given in 
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Extended Data Fig. 3 | Imaginal disc expression of vg is restricted to 
wing discs during larval development in P. hyatti. a~c, DAPI staining 
(a); wingless (wg) expression marking segment boundaries (b) and 
vestigial (vg) expression in the ventral nerve cord and in wing primordial 
cells in thoracic segments T2 and T3 (c) is shown for P. hyatti embryos. 
The three thoracic segments are labelled T1, T2 and T3. Experiments on 
embryos were repeated twice. d-f, Larval cartoons depicting leg discs 
and wing discs in queens (d), leg discs in minor workers (e) and leg discs 
and rudimentary wing discs in soldiers (f). Asterisks indicate absence 
of rudimentary wing discs. vg expression is indicated in purple. Lines at 
bottom right indicate the relative scale. g-o, vg expression is present in 
the larval (rudimentary) wing discs and is absent in the head and in leg 
discs of queens (n = 17) (g, j, m), minor workers (n = 20) (h, k, n) and 
soldiers (n = 11) (i, 1, 0). Black arrowheads indicate position of larval head 
and leg discs, and asterisks indicate absence of rudimentary wing discs. 
Experiments were repeated at least three times. p, Electrophoresis of PCR 
products obtained using vg and EF1a primers on wild-type soldier and 
minor-worker cDNA libraries, each constructed from three terminal-stage 
larvae. The 1 kb+ ladder is shown as reference and 1, 1/2, 1/4 and 1/8 
represent serial dilutions of template cDNA. vg transcript (left) is detected 
in both soldiers and minor workers using semi-quantitative reverse- 
transcription PCR. The housekeeping EF 1a transcript (right) is detected 
in both soldiers and minor workers, and levels are comparable between 
soldier and minor-worker cDNA libraries across dilutions. Negative 
controls (water with no template) show no contamination. For uncropped 
gel source data, see Supplementary Fig. 1. Experiments were repeated with 
independent biological replicates three times. 
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Extended Data Fig. 4 | vg RNAi reduces vg expression and induces 
apoptosis in rudimentary forewing discs. a, Electrophoresis of PCR 
products obtained using vg and NADH primers on soldier cDNA libraries 
each constructed from three terminal-stage larvae after vg RNAi or yfp 
RNAi (control RNAi) injection (Fig. 2a, red arrowhead). The 1 kb+ ladder 
is shown as reference and 1, 1/2, 1/4 and1/8 represent serial dilutions 
of template cDNA. vg transcript (left) is detected in both vg RNAi and 
control RNAi libraries using semi-quantitative reverse-transcription PCR. 
The housekeeping NADH transcript (right) is detected in both vg RNAi 
and control RNAi larvae, and levels are comparable between vg RNAi 
and control RNAi cDNA libraries across dilutions. Negative controls 
(water with no template) show no contamination. Experiments were 
repeated twice as independent biological replicates. For uncropped gel 
source data, see Supplementary Fig. 1. b, c, Apoptosis revealed by TUNEL 
assay in rudimentary forewing disc in control RNAi (b) and vg RNAi 
(c). Compared to control RNAi, vg RNAi induces apoptosis along the 
dorso-ventral margin of rudimentary forewing discs, where wild-type vg 
expression is normally strongest (b, c, black arrowhead). Experiments were 
repeated at least twice. 
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Extended Data Fig. 5 | Rudimentary forewing discs regulate size and 
disproportionate head-to-body scaling. a, Wild-type adult minor 
worker. b, vg RNAi intermediate adult. c, Wild-type soldier adult. Images 
to scale. d, Comparing ratios of log(head width (j1m)) to log(body length 
(um)) (log HW:log BL), between yfp RNAi (control RNAi, n = 23) and 

vg RNAi (n = 35); the box plot shows mean (+), interquartile range 

(bars), minimum-to-maximum values (whiskers); all points represent 
individual ants. Two-tailed Mann-Whitney U-test, U=219, **P= 0.0031. 
e, Percentage change in body length (1m) versus percentage change in 
head width (j1m) of yg RNAi compared to a 1:1 line. Each point represents 
(absolute( HW ot HWoontrol RNAi average)/H Weontrol RNAi average) x 100 and/or 
(absolute(BL = BLcontrol RNAi average)/BL control RNAi average) x 100. 

f, Comparing the percentage change in body length (j1m) and head width 
(um) after vg RNAi (n = 35). The box plot shows mean (+), interquartile 
range (bars), maximum-to-minimum values (whiskers); all points 
represent individual ants. One-tailed Mann-Whitney U-test, U= 470, 
*P= 0.0477. g, Wild-type minor worker. l, yfp RNAi (control RNAi) 
soldier. g-l, vg RNAi individuals showing a range of intermediates between 
minor worker and soldier (see Fig. 2i). Wild-type minor worker is shown 
for reference in a, g. All image comparisons are to scale. Experiments 
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were repeated at least three times. m-t, Electrosurgical ablation of leg 

and rudimentary forewing discs in soldier-destined larvae (Fig. 2a, red 
arrowhead). m, Wild-type soldier, leg and rudimentary forewing discs. 

n, Site of leg disc cauterization shown by melanized cuticle. 0, Ablation 

of leg disc (DAPI, red asterisk). p, Site of rudimentary forewing disc 
cauterization shown by melanized cuticle. q, Ablation of rudimentary 
forewing disc (DAPI, red asterisk). White or black arrowheads indicate 
the presence of rudimentary forewing discs. All images are to scale. 

r, Comparing log(head width (j1m)) between leg disc ablation (n = 16) and 
rudimentary forewing disc ablation (n = 16); one-tailed Mann-Whitney 
U-test, U= 82, *P = 0.0432. s, Comparing log(body length (j1m)) between 
leg disc ablation (n = 16) and rudimentary forewing disc ablation (n = 16). 
One-tailed unequal variance t-test, t= 1.77, d.f. =22.34,*P =0.045. 

t, Comparing ratio of log(head width (j1m)) to log(body length(j1m)), 
between leg disc ablation (n = 16) and rudimentary forewing disc ablation 
(n= 16). One-tailed unpaired t-test, t= 2.07, d.f.=30, *P=0.0234. 

The box plots in d, f, r-t show mean (+), interquartile range (bars) and 
minimum-to-maximum values (whiskers); all points represent individual 
ants. Experiments were repeated at least twice. 
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Extended Data Fig. 6 | The function of rudimentary forewing discs 

in regulating disproportionate head-to-body scaling is specific to 

the soldier subcaste. vg RNAi on bipotential larvae (Fig. 2a, orange 
arrowhead). a, yfp RNAi (control RNAi) minor worker. b, vg RNAi minor 
worker. c, vg RNAi intermediate. d, yfp RNAi (control RNAi) soldier. 
Image comparisons to scale. e, log(body length (j1m)) versus log(head 
width (j1m)), comparing minor worker head-to-body scaling of yfp 

RNAi (control RNAi, n= 69) and vg RNAi (n = 84). ANCOVA: slope, 
F=0.162, d.f.=149, P=0.69; y intercept, F= 0.4755, d.f.= 150, P=0.49). 
f, log(body length (j1m)) versus log(head width (j1m)), comparing head- 
to-body scaling of yfp RNAi (control RNAi; n = 6) and vg RNAi (n= 4) 
ants that initiated soldier development. ANCOVA: F = 7.44, d.f.=6, 
P=0.0343. g, Comparing log(head width (j1m)) between control RNAi 
(n= 69) and vg RNAi (n = 84) minor workers. Two-tailed unpaired 

t-test: f= 1.62, d.f.= 151, P=0.1074. h, Comparing log(body length (\1m)) 
between control RNAi (n= 69) and vg RNAi (n = 84) minor workers. 
Two-tailed unpaired t-test: t= 1.54, d.f. = 151, P=0.1249. i, Comparing 
ratios of log(head width (j1m)) to log(body length (j1m)) between 

control RNAi (n= 69) and vg RNAi (n= 84) minor workers. Two-tailed 
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unpaired t-test: t= 0.26, d.f.= 151, P= 0.80. Experiments were repeated 
at least three times. j-p, vg RNAi on male-destined larvae (Fig. 2a, green 
arrowhead). j, Bar graph showing percentage of individual ants affected 
(red) after yfp RNAi (control RNAi; n=0 out of 23) and vg RNAi (n=9 
out of 21). Two-tailed Fisher’s exact test: P= 0.0004. k, 1, Wings from yfp 
RNAi (control RNAi) (k) and vg RNAi (1) males. Image comparisons are 
to scale. m-o, Head-to-body scaling of yfp RNAi (control RNAi; n = 23) 
and vg RNAi males (n = 21), comparing log(head width (\1m)), two-tailed 
Mann-Whitney U-test, U= 222.5, P=0.6626 (m); log(body length (j1m)), 
two-tailed Mann-Whitney U-test, U= 198, P=0.3157 (n); and ratio 

of log(head width (j1m)) to log(body length (j1m)) (0), between control 
RNAi males and vg RNAi males, two-tailed unpaired t-test, t= 0.18, 
d.f.=42, P=0.86. p, log(body length (j1m)) versus log(head width (1m)) 
comparison of head-to-body scaling, between yfp RNAi (control RNAi; 
n= 23) and vg RNAi males (n =21). ANCOVA: slope, F = 3.111, d.f.=40, 
P=0.0854; y intercept, F=0.076, d.f.=41, P=0.7837. Experiments 

were repeated at least three times. The box plots in g-i and m-o show 
mean (+), interquartile range (bars) and minimum-to-maximum values 
(whiskers); all points represent individual ants. 
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Extended Data Fig. 7 | Soldier inhibitory pheromone regulates size 
and disproportionate head-to-body scaling. a-k, Effect of social inhibition 
on soldier-destined larvae (Fig. 2a, red arrowhead). a, Wild-type 

adult minor worker. b, Intermediate adult resulting from soldier- 

destined larvae being raised in colonies composed of 100% soldiers (high 
inhibition). c, Representative adult soldier resulting from soldier-destined 
larvae being raised in colonies composed of 100% minor workers (no 
inhibition). d, Comparing ratios of log(head width (1m)) to log(body 
length (j1m)) between 100% minor worker (n = 24) and 100% soldier 

(n= 35). The box plot shows mean (+), interquartile range (bars) and 
minimum-to-maximum values (whiskers); all points represent individual 
ants. Two-tailed Mann-Whitney U-test, U= 155, ****P < 0.0001. 

e, Percentage change in body length (1m) versus percentage change in head 
width (\1m) of 100% soldiers, compared to a 1:1 line. Each point represents 
(absolute( HW ~ HW 100% minor worker average)/HW 100% minor worker average) 

x 100 and/or (absolute(BL — BLioo% minor worker average)/BL 90% minor worker average) 
x 100. f, Comparing percentage change in body length (j1m) and head 


control SD CHCs 


width (jm) following high inhibition (100% soldiers; n = 35). One-tailed 
Mann-Whitney U-test, U=421, *P=0.0121. g, Wild-type minor worker. 
h-j, 100% soldier-raised ants showing a range of intermediates between 
minor workers and soldiers (see Fig. 3e). k, 100% minor-worker control 
soldier. All image comparisons are to scale. Experiments were repeated at 
least three times. l-p, Application of soldier cuticular hydrocarbon extract 
(CHCs) to soldier-destined larvae (Fig. 2a, red arrowhead). 1, Wild-type 
minor worker and individual ants treated with soldier CHCs and hexane 
solvent (control). m, Comparing slopes of hexane solvent control (n= 21) 
and soldier CHCs (n= 19). ANCOVA, F= 6.84, d.f. = 36, P=0.0129. 

n, log(head width (\1m)); one-tailed Mann-Whitney U-test, U= 114, 

** P= 0.0099. 0, log(body length (\1m)); one-tailed Mann-Whitney U-test, 
U=117, *P=0.0126. p, Ratio of log(head width (1m)) to log(body length 
(jum)); one-tailed Mann-Whitney U-test, U= 125, *P= 0.0221. Wild-type 
minor worker is shown for reference in a, g, 1. The box plots in d, f, n-p 
show mean (+), interquartile range (bars), minimum-to-maximum values 
(whiskers); all points represent individual ants. 
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Extended Data Fig. 8 | Juvenile hormone and inhibitory pheromone 
regulate disc-dependent disproportionate head-to-body scaling and 
disc-independent proportional head:body scaling. Effect of juvenile- 
hormone activation and social inhibition on bipotential larvae (Fig. 2a, 
orange arrowhead). a, Absence of rudimentary wing discs in minor worker 
larvae exposed to solvent control with no inhibition (‘acetone + 100% 
MW)). a, e, i, m, Arrowheads indicate the presence of rudimentary wing 
discs and asterisks indicate the absence of rudimentary wing discs. 

b, Plot of head-to-body scaling of acetone + 100% minor worker, and 
juvenile-hormone activation with no inhibition (JH + 100% MW’); the 
majority of larvae treated with juvenile hormone develop into soldiers, 
and some develop into large minor workers. c, d, Comparing between 
acetone + 100% minor worker (n= 7) and juvenile hormone + 100% 
minor worker (1 = 17) treatments of individuals that developed into 

the soldier size distribution. c, log(body length (\1m)); two-tailed 
unpaired t-test; t= 5.25, d.f. = 22, ****P < 0.0001. d, log(head width 
(jum)); two-tailed unpaired t-test, t= 3.50, d.f.=22, **P=0.002. 

e, Initiation of growth of rudimentary forewing discs in minor worker 
larvae exposed to juvenile hormone + 100% minor worker. f, Plot of 
head-to-body scaling between acetone + 100% minor worker (n = 114) 
and juvenile hormone + 100% minor worker (n = 29) of individuals that 
developed into the minor-worker size distribution. ANCOVA: F=7.12, 
d.f.= 139, P=0.0085. g, h, Comparing between acetone + 100% minor 
worker (= 114) and juvenile hormone + 100% minor worker (n= 29) 
treatments of individuals that developed into the minor-worker size 
distribution. g, log(body length (j1m)); two-tailed Mann-Whitney U-test, 
U= 463, ****P < 0.0001. h, log(head width (1m)); two-tailed unequal 
variance t-test, t=5.19, d.f.=32, ****P < 0.0001. i, Absence of growth 
of rudimentary wing discs in minor worker larvae exposed to juvenile- 


N 
@ 


acetone+ JH+ 
100% SD 100% SD 


acetone + acetone + 
100% MW 100% SD 


hormone activation with high inhibition (JH + 100% SD’). j, Plot of 
head-to-body scaling between solvent control with high inhibition 
(‘acetone + 100% SD’; n = 88) and juvenile hormone + 100% soldiers 
(n= 46) of individuals that developed into the minor-worker size 
distribution. ANCOVA: slope, F = 0.54, d.f.= 130, P=0.47; y intercept, 
F=27.2, d.f.=131, P< 0.0001. k, 1, Comparing between acetone + 100% 
soldiers (n = 88) and juvenile hormone + 100% soldiers (n = 46) 
treatments of individuals that developed into the minor-worker size 
distribution. k, log(body length (j1m)); two-tailed unequal variance 
t-test, t= 4.43, d.f.=69, ****P < 0.0001. 1, log(head width (1m)); 
two-tailed unequal variance t-test, t= 6.69, d.f. = 68,****P < 0.0001. 

m, Absence of rudimentary wing disc growth in minor worker larvae 
exposed to acetone + 100% soldiers. n, Plot of head-to-body scaling 
between acetone + 100% minor workers (n= 114) and acetone + 100% 
soldiers (n = 88) treatments of individuals that developed into the minor- 
worker size distribution. ANCOVA: slope, F = 2.84, d.f. = 198, P=0.0937; 
y intercept, F= 20.77, d.f.=199, P< 0.0001. 0, p, Comparing between 
acetone + 100% minor workers (n = 114) and acetone + 100% soldiers 
(n= 88) treatments of individuals that developed into the minor-worker 
size distribution. 0, log(body length (1m)); two-tailed Mann-Whitney 
U-test, U= 3178, ****P < 0.0001. p, log(head width (11m)); two-tailed 
unpaired t-test, t= 0.28, d.f. = 200, P= 0.7823. Bonferroni correction was 
applied to comparison of log(head width) and log(body length). The box 
plots in c, d, g, h, k, 1, 0, p show mean (+), interquartile range (bars) and 
minimum-to-maximum values (whiskers); all points represent individual 
ants. Plots in b, f, j, n show linear regressions in which the x axis is 
log(body length (\1m)) and the y axis is log(head width (j1m)). All images 
are to scale. Experiments were repeated at least three times. 
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Extended Data Table 1 | Description and references for presence or absence of soldier subcaste and discrete variation in rudimentary wing 
disc size across 21 ant species 


Presence of a Inter-subcaste 


3 soldier (major variation in Description of variation in 
Species Reference "i F . 7 bn as Reference 
worker) rudimentary wing rudimentary wing disc size 
subcaste disc size? 
Wilson® MW= no visible discs 
Pheidole rhea Yes Rajakumar et al.”°, Yes SD = 2 pairs of discs Rajakumar et al.”° 
Huang & Wheeler” XSD = 2 pairs of large discs 

: F 8 MW. no visible discs Rajakumar et al.”*, 
Pheidole megacephala Yes Mien Yes SD = 1 pair of forewing discs Sameshima et al.’* 

: . Wilson®, Huang & MW= no visible discs F 10 
Pheidole spadonia Yes Wheeler®” Yes Si 1 aatratformwine des Rajakumar et al. 

. - ft SB MW= no visible discs F 10 
Pheidole pilifera Yes Wilson Yes S04 aelrat forewing ise Rajakumar et al. 
Pheidole tysoni Yes Wilson® Yes baie MEIBIG a Rajakumar et al.”° 

SD = 1 pair of forewing discs 
f 8 - ae 5 
Pheidole bicarinata Yes vellsan 7 ier aaa 7 1s eae ee assis: Wheeler & Nijhout” 
Nijhout SD = 1 pair of forewing discs 
Pheidole moerens Yes Wilson® Yes MWe ne visible discs, Rajakumar et al.”° 
SD = 1 pair of forewing discs 
Wilson’, MW= no visible discs Ratakumarétal22 
Pheidole obtusospinosa Yes Rajakumar et ale’, Yes SD = 1 pairs of forewing discs nen ‘iid _ 
Huang & Wheeler” XSD = 2 pairs of large discs Y 
ae z Abouheif & Wray’, 
Pheidole morrisi Yes Wilson® Yes sp heat sel sie m Rajakumar et al.’°, 
eee 8 Shbailat & Abouheif™ 

, i, Wilson® MW. no visible discs Rajakumar et al.*°, 
Ppeiaole nyartt Nes aes SD = 1 pair of forewing discs Present study 
Pheidole vallicola Yes Wilson® Yes shanti Mipble bagel Rajakumar et al.° 

SD = 1 pair of forewing discs 
MW= isible di 
Pheidole dentata Yes Wilson® Yes eee cists Rajakumar et al.”° 
SD = 1 pair of forewing discs 
. 59 
Tetramorium caespitum No a oa 7 No W = 2 pairs of small wing discs Shbailat & Abouheif® 
. 61 _ ‘ , Abouheif & Wray’, 
Crematogaster cerasi No Morgan & Mackay No W = 2 pairs of small wing pads Shbailat & Abouheif® 
Monomorium trageri No Dubois” No W =no visible discs Present study 
; 2 Tschinkel”°, MW = 2 pairs of small wing pads 

Solenopsis geminata Yes Wilson Yes SD = 2 pairs of large wing discs Present study 
Myrmica americana No Weber™ No W =2 pairs of small wing discs Abouheif & Wray’ 
Lasius niger No Wilson® No W = 2 pairs of small wing discs Shbailat & Abouheif® 
Formica pallidefulva 

described as Neoformica No Trager, Maccown = No W = 2 pairs of small wing discs Abouheif & Wray’ 

Trager 
nitidiventris in’) 8 
MW = 2 pairs of Il wi d 
Camponotus floridanus Yes Alvarado”’, Wilson? Yes rae aie we bas i Present study 
SD = 2 pairs of large wing discs 
Mysterium obertheuri No Béhague et al.”” No W = 2 pairs of wing discs Béhague et al.” 


MW, minor worker; SD, soldier; W, worker; XSD, supersoldier. Refer to Extended Data Fig. 1a for the phylogeny. Data were obtained from previous publications 
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Clearance of senescent glial cells prevents 
tau-dependent pathology and cognitive decline 


Tyler J. Bussian!?, Asef Aziz**, Charlton F. Meyer’, Barbara L. Swenson?, Jan M. van Deursen!? & Darren J. Baker!?* 


Cellular senescence, which is characterized by an irreversible cell- 
cycle arrest! accompanied by a distinctive secretory phenotype’, 
can be induced through various intracellular and extracellular 
factors. Senescent cells that express the cell cycle inhibitory protein 
p16!\**4 have been found to actively drive naturally occurring 
age-related tissue deterioration** and contribute to several 
diseases associated with ageing, including atherosclerosis’ and 
osteoarthritis®. Various markers of senescence have been observed 
in patients with neurodegenerative diseases’~°; however, a role for 
senescent cells in the aetiology of these pathologies is unknown. 
Here we show a causal link between the accumulation of senescent 
cells and cognition-associated neuronal loss. We found that the 


MAPT"?°!SPS19 mouse model of tau-dependent neurodegenerative 
disease! accumulates p16'NX*4_positive senescent astrocytes 
and microglia. Clearance of these cells as they arise using INK- 
ATTAC transgenic mice prevents gliosis, hyperphosphorylation 
of both soluble and insoluble tau leading to neurofibrillary tangle 
deposition, and degeneration of cortical and hippocampal neurons, 
thus preserving cognitive function. Pharmacological intervention 
with a first-generation senolytic modulates tau aggregation. 
Collectively, these results show that senescent cells have a role in 
the initiation and progression of tau-mediated disease, and suggest 
that targeting senescent cells may provide a therapeutic avenue for 
the treatment of these pathologies. 


Fig. 1 | Senescent astrocytes and 

microglia that accumulate in the brains 

of MAPT?25!SPS19 mice can be removed 
using the INK-ATTAC transgene. a, RT- 
qPCR analysis of p16'"** expression in the 
hippocampus (left) and cortex (right) of wild- 
type (WT) and MAPT?2"5P§19 mice. The 
number of mice for each column are indicated 
in parentheses in the left-hand graph, 2 


* 
mo 


independent experiments; normalized to 

the three-month wild-type group. b, Study 
design for the clearance of senescent cells in 
PS19;ATTAC mice. Veh., vehicle. c, RT-qPCR 


analysis of the expression of senescence 


markers in the hippocampus (left) and cortex 


(cortex) of six-month-old male mice, treated 


with either vehicle (-AP) or AP20187 (+AP). 
n=5 mice per group; normalized to the 
ATTAC -AP group. d, Electron micrograph 
showing an X-Gal-positive astrocyte (left) 
and microglia (right) after SA-6-Gal staining, 
ie from a six-month-old vehicle-treated 
PS19;ATTAC male mouse. e, Quantification 
of cells containing X-Gal crystals in the 
hippocampus (left) or the cortex (right) of 
six-month-old male mice. n = 3 male mice per 
group, 2 independent experiments; the bars 


8 months 


are coloured as in c. Scale bars, 1 {1m (d) and 
200 nm (d, insets). Data are mean +s.e.m. 
*P< 0.05; **P< 0.01; ***P < 0.001 (unpaired 
two-sided t-tests with Welch’s correction (a) 
and one-way ANOVA with Tukey’s multiple 
comparisons test (c, e)). Exact P values can be 
found in the accompanying Source Data. 
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Fig. 2 | Senescent cells promote insoluble tau aggregates. a, Representative 
western-blot analysis of the whole brain of six-month-old mice for soluble 
tau (top), soluble phosphorylated tau (S202/T205; middle) and insoluble 
phosphorylated tau (S202/T205; bottom). >3 independent experiments. 
b, Immunostaining and quantification of cortex sections from six-month- 
old mice for phosphorylated tau (S202/T205) protein aggregates. n = 6 
mice per group, 3 independent experiments. c, Thioflavin-S staining 


Senescent cells accumulate with ageing and have been shown to con- 
tribute to tissue dysfunction", although their role in neurodegenerative 
disease is still unknown. To address this key open question, we selected 
the transgenic mouse line MAPT?9lSPS19 (hereafter PS19), which— 
under the regulation of the mouse prion promoter—expresses high 
levels of mutant human tau specifically in neurons!°. The model is char- 
acterized by gliosis, neurofibrillary tangle (NFT) deposition, neurode- 
generation and loss of cognitive function. Pathology typically initiates 
in the hippocampus and radiates outwards to the neocortex’”. First, we 
performed quantitative PCR with reverse transcription (RT-qPCR) for 
p16'"*# (also known as Cdkn2a) on isolated hippocampi and cortices 
from wild-type and PS19 littermates. A significant increase in p16!" 
expression was seen in the PS19 mice beginning at four months of age 
in the hippocampus and at six months of age in the cortex (Fig. 1a), 
which precedes the onset of NFT deposition!®. Importantly, increased 
expression of p16!"** correlated with the expression of widely estab- 
lished senescence markers (Extended Data Fig. 1), indicating that 
senescent cells accumulate at sites of pathology in the PS19 model. 

To investigate the role of senescent cells in the development of 
disease, we crossed the INK-ATTAC transgene (hereafter ATTAC) to the 
PS19 strain to eliminate p16'"*“*-expressing senescent cells through the 
twice-weekly administration of AP20187 (hereafter AP)** from weaning 
age (Fig. 1b). Hippocampi and cortices isolated from six-month-old 
vehicle-administered PS19;ATTAC mice displayed an increased level of 
the ATTAC transgene, as measured by the expression of Casp8 and Gfp 
(Fig. 1c, Extended Data Fig. 2). Levels of senescence indicators, includ- 
ing the cell-cycle regulators p16", p1947 (also known as Cdkn2a) and 
p21°P!/W4l! (also known as Cdkn1a) and the pro-inflammatory genes 
Pail (also known as Serpine1), Il6 and I11b, were also increased (Fig. 1c, 
Extended Data Fig. 2). Administration of AP to PS19;ATTAC mice 


IB PS19;ATTAC - AP 
IB PS19;ATTAC + AP 


PS19;ATTAC + AP 


kek kk 


Tangles per ROI (x104) 
Le) 
Oo 


and quantification of neurofibrillary tangles located within the dentate 
gyrus of eight-month-old mice. n = 6 mice per group, 2 independent 
experiments; normalized to the ATTAC + AP group. The bar graph is 
coloured as in b. ROI, region of interest. Scale bars, 100 1m (b) and 504m 
(c). Data are mean + s.e.m. **P < 0.01; ***P < 0.001 (one-way ANOVA 
with Tukey’s multiple comparisons test). Exact P values can be found in the 
accompanying Source Data. For gel source data, see Supplementary Fig. 1. 


maintained the expression of these genes at a level comparable to that 
of control mice (Fig. 1c, Extended Data Fig. 2). Importantly, treatment 
of ATTAC mice that lacked the PS19 transgene with AP had no effect 
on the expression of these markers (Extended Data Fig. 2). Thus, AP 
administration effectively and selectively cleared senescent cells in the 
hippocampus and cortex of PS19;ATTAC mice. 

To understand the mechanistic contribution of senescence to 
tau-mediated pathology, we sought to identify the specific cell 
types that were becoming senescent. First, we stained cortices and 
hippocampi from six-month-old vehicle-treated ATTAC and 
PS19;ATTAC mice and AP-treated PS19;ATTAC mice for 
senescence-associated-8-galactosidase (SA-3-Gal)!” and screened for 
cells that contained 5-bromo-4-chloro-3-indolyl-6-p-galactopyranoside 
(X-Gal) crystals by transmission electron microscopy!?. We found that 
cells that clearly and morphologically resembled astrocytes or micro- 
glia contained X-Gal crystals, irrespective of the treatment group from 
which they arose (Fig. 1d). By contrast, no crystals were found in any 
clearly identifiable neurons (Extended Data Fig. 3). Compared with 
control mice, vehicle-treated PS19;ATTAC mice had nearly double the 
number of cells containing X-Gal crystals in both the hippocampus 
and the cortex (Fig. le), whereas AP-treated PS19;ATTAC mice had a 
similar incidence of X-Gal crystals as control mice (Fig. le). To validate 
that senescence was affecting astrocytes and microglia, we performed 
fluorescence-activated cell sorting (FACS) on six-month-old wild-type 
and PS19 mice (Extended Data Fig. 4a). Isolated astrocytes and 
microglia had increased expression of senescence-associated genes, 
including p16'"*#* (Extended Data Fig. 4b, c). A similar induction 
was not observed in oligodendrocytes or neuron-enriched CD56~* 
cells (Extended Data Figs. 4d, e, 5), supporting the conclusion that 
senescence occurs primarily in astrocytes and microglia of PS19 mice. 
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Fig. 3 | Senescent cells drive neurodegenerative disease. a, Sagittal 
midline brain area images (left) and quantification (right) of 
eight-month-old mice. =5 males per group, 2 independent experiments. 
b, Nissl stains of the dentate gyrus (left) and quantification (right) from 
eight-month-old mice. c, Average area of the dentate gyrus (measuring the 
pyramidal neuron layer) from serial, coronal NeuN-stained free-floating 
sections. n= 6 ATTAC -AP and n=7 PS19;ATTAC — AP and PS19;ATTAC 
+ AP mice, 2 independent experiments. d, Setup of the novel-object 


To verify that the administration of AP selectively targeted senescent 
cells, we prepared in vitro cultures of primary microglia and astro- 
cytes isolated from ATTAC mice. These cultures were not sensitive to 
AP-mediated elimination in the absence of senescence-inducing stimuli 
(Extended Data Fig. 6). Furthermore, for ATTAC transgenic mice in 
vivo, the short-term administration of AP did not promote excessive 
cellular death (Extended Data Fig. 7a), and extended treatment with 
AP did not result in increased proliferation of microglia (Extended 
Data Fig. 7b). 

PS19 mice present progressive gliosis with disease progression)”. 
To assess whether the administration of AP affected this process, 
RT-qPCR was performed on the hippocampi of six-month-old mice 
for markers of astrocytes (Gfap and S100b) and microglia (Cd11b, 
also known as Itgam). Vehicle-treated PS19;ATTAC mice had an 
approximately two- to threefold induction in these markers, whereas 
AP-treated PS19;ATTAC mice expressed these markers at a similar level 
to control mice (Extended Data Fig. 8a, b). Immunohistochemistry 
staining for GFAP and IBA] confirmed these observations (Extended 
Data Fig. 8c, d). Taken together, these results suggest that both senes- 
cent glial cells and gliosis are eliminated upon the administration of 
AP in PS19;ATTAC mice. 

A distinguishing characteristic of PS19 mice is the development of 
aggregates consisting of hyperphosphorylated tau protein by six months 
of age’®. To assess whether tau aggregation was affected by senescence 
clearance, we measured the levels of soluble total and phosphorylated 
tau (S202/T205) in addition to the level of insoluble phosphorylated 
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recognition experiment (left) and average ratio for the number of 
investigations (middle) and duration of those investigations (right). n =8 
female mice per group. Scale bars, 0.5 cm (a) and 100m (b). Data are 
mean +s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (one-way ANOVA 
with Tukey’s multiple comparisons test (a, c) and two-way ANOVA with 
Tukey’s multiple comparisons test (d)). Exact P values can be found in the 
accompanying Source Data. 


tau in vehicle-treated PS19;ATTAC and AP-treated ATTAC and 
PS19;ATTAC mice. As expected, vehicle-treated PS19;ATTAC mice 
displayed increased levels both of soluble total and phosphorylated 
tau and of insoluble phosphorylated tau (Fig. 2a, Extended Data 
Fig. 9a, b). AP-treated PS19;ATTAC mice had identical levels of soluble 
total tau to vehicle-treated PS19;ATTAC mice (Fig. 2a), indicating that 
overexpression of tau from the transgene was maintained. Treatment 
of PS19;ATTAC mice with AP significantly reduced the amount of 
phosphorylated tau in both the soluble and the insoluble fractions 
(Fig. 2a, Extended Data Fig. 9b). Immunohistochemistry staining for 
phospho-tau modifications at $202/T205, T231 and $396 confirmed 
that the clearance of senescent cells attenuated tau phosphorylation at 
several residues that are relevant for tau aggregation (Fig. 2b, Extended 
Data Fig. 9c). Additionally, the staining of brain sections of eight- 
month-old mice from these same groups with thioflavin-S revealed 
that NFT deposition in the dentate gyrus—the site of neurogenesis 
in the hippocampus that is traditionally associated with memory 
formation and cognition'*—was substantially reduced when senescent 
cells were removed (Fig. 2c). Collectively, these results indicate that the 
accumulation of senescent cells promotes the formation of hyperphos- 
phorylated tau aggregates. 

PS19 mice show neurodegeneration by eight months of age!®. As 
NEFT deposition was attenuated upon treatment with AP in both the 
cortex and the hippocampus of PS19;ATTAC mice, we performed 
assessments for degeneration in these areas. The overt brain size of 
vehicle-treated PS19;ATTAC mice was reduced compared to both 
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Fig. 4 | ABT263 can modulate senescent cells and attenuate tau 
phosphorylation. a, RT-qPCR analysis of the expression of senescence 
markers from the hippocampus (left) and the cortex (right) of six-month- 
old mice treated with either vehicle (-ABT263) or ABT263 (+ ABT263). 
Number of mice is as indicated; normalized to the WT - ABT263 group. 
b, Representative immunostaining of cortex (top) and hippocampal 


ATTAC and AP-treated PS19;ATTAC mice (Fig. 3a). In addition, we 
observed localized neurodegeneration in the dentate gyrus of the hip- 
pocampus through Nissl staining in vehicle-treated PS19;ATTAC mice 
(Fig. 3b). The administration of AP prevented thinning of the dentate 
gyrus and increased neuron density. Sequential coronal sectioning and 
NeuN staining revealed that the dentate gyrus was significantly reduced 
in area in vehicle-treated PS19;ATTAC mice (Fig. 3c), further demon- 
strating that senescent cells promote neurodegeneration in PS19 mice. 

To test whether the effects observed upon administration of AP 
resulted in improved cognitive function, we performed novel-scent 
discrimination assessments to test for changes in short-term memory”® 
(for experimental setup, see Fig. 3d). Whereas AP-treated ATTAC mice 
were more inquisitive towards the novel scent during the testing phase, 
vehicle-treated PS19;ATTAC mice were not (Fig. 3d). By contrast, 
AP-treated PS19;ATTAC mice behaved nearly identically to control 
(AP-treated ATTAC) mice, indicating that the elimination of senescent 
cells mitigated the short-term memory loss observed in vehicle-treated 
PS19;ATTAC mice. Notably, the overall distance travelled by mice in 
all groups was unchanged (data not shown), and similar results were 
obtained in novel-object discrimination tests that had the same setup 
but used visual cues instead of scents (Extended Data Fig. 10). Thus, 
these results demonstrate that senescent cells drive neurodegeneration 
and loss of cognition in PS19 mice. 

Last, we tested whether the pharmacological elimination of senes- 
cent cells with the senolytic ABT263 (navitoclax)>°!©"7 resulted in 
similar effects to our genetic interventions in PS19 mice. Recent work 
has demonstrated a therapeutic effect in orthotopically implanted glio- 
blastomas of the peripheral administration of ABT263!*. Wild-type 
and PS19 mice were treated with a repeating schedule of ABT263, 
beginning at weaning age and continuing until the mice reached six 


(bottom) sections for phosphorylated tau (S202/T205) protein aggregates. 
n=4 mice per group, 2 independent experiments. Scale bar, 100 1m. Data 
are mean +s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (one-way ANOVA 
with Tukey’s multiple comparisons test). Exact P values can be found in 
the accompanying Source Data. 


months of age. Notably, this treatment prevented the upregulation of 
senescence-associated genes (Fig. 4a) and attenuated tau phosphoryl- 
ation in PS19 mice (Fig. 4b), indicating that senolytic interventions 
can recapitulate key observations from transgenic mouse models of 
senescent-cell ablation. 

The mechanistic contribution of cells with features reminiscent of 
senescence to the pathophysiology of neurodegenerative diseases has 
been a common question in recent years’ 19-21, Furthermore, recent 
work has suggested that senescent cells may contribute to the pathology 
of Parkinson's disease in both mice and humans”. Here we show that 
the continuous clearance of p16'"*#*-expressing senescent cells before 
disease onset in a model of aggressive tauopathy has a marked effect 
on various aspects of disease progression, including gliosis, NFT for- 
mation, neurodegeneration and cognitive decline. Senescent-cell clear- 
ance also has a notable effect on the accumulation of phosphorylated 
tau protein in both the soluble and insoluble fractions. The amount 
of total soluble tau was unchanged in AP-treated PS19;ATTAC mice 
(Fig. 2a), indicating that the aberrant hyperphosphorylation of tau pro- 
tein and subsequent aggregation into NFTs is mediated by extracellular 
signalling from p16""““"-expressing senescent glial cells. The molecular 
mechanisms that senescent astrocytes and microglia exploit to promote 
the pathological conversion of tau into NFTs within neurons require 
additional investigation. The absence of neurodegeneration in mice 
treated with AP (Fig. 3) demonstrates that the attenuation of disease 
severity does not result from the clearance of neurons containing 
NFTs. However, it is important to leave open the possibility that other 
models of neurodegenerative disease may exhibit senescence-associated 
alterations in cell types other than those observed in the present study. 
Regardless, based on our observations, it is likely that intervening with 
senescent-cell accumulation in these models would also reduce the 
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severity of disease. As this study was designed to prevent senescent 
cells from accumulating to determine how this affects disease, future 
studies of senolysis in established disease models will be necessary to 
determine whether senolytic strategies could translate into the clinic 
to halt or perhaps revert disease. As senescent cells exhibit a unique 
and identifiable senescence-associated secretory phenotype, exploiting 
this phenotype may serve as a possible therapeutic avenue to attenuate 
many tau-dependent pathologies. Our observation that p16!" expres- 
sion increases before the onset of NFT aggregation further supports the 
now commonly held belief that early intervention in these diseases is 
essential to provide more beneficial effects for patients. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
https://doi.org/10.1038/s41586-018-0543-y. 
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METHODS 

Mouse strains and drug treatment. MAPT?*°/SPS19 (PS19) mice were purchased 
from The Jackson Laboratory (stock no. 008169) and bred to C57BL/6 for three 
generations. C57BL/6 ATTAC transgenic mice are as previously described*". 
Male PS19 mice were bred to ATTAC females to generate cohorts of ATTAC and 
PS19;ATTAC mice. All mice were on a pure C57BL/6 genetic background. Mice 
from this cohort were randomly assigned to receive AP20187 (AP; B/B homod- 
imerizer; Clontech) or vehicle twice a week beginning at weaning age (3 weeks). 
Dosing of AP was 2.0 mg per kg body weight. Six-month-old short-term AP-pulse- 
treated mice (Extended Data Fig. 6a) received a dose of 10 mg per kg body weight 
for five consecutive days before tissue collection. Senolytic intervention was per- 
formed in C57BL/6 wild-type and PS19 mice. At weaning, mice were assigned to 
receive either ABT263 (Cayman, 923564-51-6) or vehicle (Phosal 50 PG, Lipoid 
NC0130871, 60%; PEG400, Sigma 91893, 30%; ethanol, 10%). ABT263 was admin- 
istered by oral gavage at a dose of 50 mg per kg body weight on a repeating regimen 
of five consecutive days of treatment followed by 16 days of rest. Mice were housed 
ina 12 h:12h light:dark cycle environment in pathogen-free barrier conditions as 
previously described in detail’. Compliance with relevant ethical regulations and 
all animal procedures were reviewed and approved by the Mayo Clinic Institutional 
Animal Care and Use Committee. 

Statistical analysis. Prism software was used for all statistical analysis. A Student's 
two-tailed unpaired t-test with Welch's correction was used in Fig. la and Extended 
Data Fig. 4b-e; two-way ANOVA with Tukey’s multiple comparisons test was used 
for Fig. 3d and Extended Data Fig. 10; and one-way ANOVA with Tukey’s multiple 
comparisons test was used in all other figures. For consistency in these compar- 
isons, the following denotes significance in all figures: *P < 0.05, **P<0.01, 
***P <0.001. We note that no power calculations were used. Sample sizes are 
based on previously published experiments in which differences were observed. 
No samples were excluded. Investigators were blinded to allocation during exper- 
iments and outcome assessment, except for rare instances in which blinding was 
not possible. 

Senescence-associated (3-galactosidase transmission electron microscopy 
(Gal-TEM). Detection of X-Gal crystals by TEM after senescence-associated 
-galactosidase (SA-3-Gal) staining was performed as previously described?, 
with the following alterations to accommodate central nervous tissue. Mice were 
transcardially perfused with ice-cold Dulbecco's phosphate-buffered saline (DPBS; 
pH 7.4) until fluid run-off was clear. This was followed by perfusion with 4% para- 
formaldehyde (PFA) for 10 min at a rate of 3 ml min~!, and then ice-cold DPBS was 
perfused again for 2 min at the same rate to remove the remaining fixative. Brains 
were then isolated and the hippocampus and cortex were dissected out. A 1 mm 
x 1mm piece from the CA1 and M1 regions, respectively, was then incubated in 
SA-8-Gal staining solution (Cell Signaling) at 37°C for 6 h (hippocampus) or 18 
h (cortex). The samples were placed in Trump’s fixative overnight at 4°C before 
being processed for routine transmission electron microscopy (dehydration by 
xylene-alcohol series, osmium tetroxide staining, and Epon resin embedding). 
Images were acquired and quantified using a Jeol 1400+ electron microscope with 
an 80-kV acceleration voltage. Two grids from each tissue were produced, and 
>100 cells were scanned per grid at a magnification of 20,000 to detect cells 
containing X-Gal crystals. On average, half of all cells examined were neurons. 
Cells with one or more crystals, and the total number of cells, were counted. Cells 
containing crystals were imaged and independently assessed for distinguishing 
morphology. To define cell type, the following criteria were applied. Astrocytes: 
circular nucleus with spattered electron density pattern; microglia: abnormally 
shaped nucleus with a much darker, often phagosome-containing cytoplasm; and 
neuron: large circular nucleus with less electron density and periodically denoted 
by an offshooting axon. Only cells with morphology consistent with astrocytes or 
microglia were clearly positive for X-gal crystals. 

Western blotting for soluble and sarkosy] insoluble proteins. Half brains were 
weighed and homogenized in 5 x volume of Buffer I (50 mM Tris base (pH 7.4), 
50 mM NaCl, 1 mM EDTA, 1 mM phenylmethylsulfonyl fluoride (PMSF), 1x 
Halt Protease and Phosphatase Inhibitor Cocktail (Thermo)). Two hundred and 
fifty microlitres of the homogenate was then added to an equal volume of Buffer S 
(50 mM Tris (pH 8.0), 274 mM NaCl, 5 mM KCl, 1 mM PMSF, 1X Halt Protease 
and Phosphatase Inhibitor Cocktail (Thermo)) and ultracentrifuged at 150,000¢ 
for 15 min at 4°C. The supernatant (S1-soluble protein fraction) was transferred 
to a new tube and the pellet homogenized in 3 x volume of sucrose buffer (10 mM 
Tris (pH 7.4), 0.8M NaCl, 10% sucrose, 1 mM EGTA, 1 mM PMSF) before being 
ultracentrifuged at 150,000g for 15 min at 4°C. The pellet was discarded and the 
supernatant incubated with sarkosyl (sodium lauroyl sarcosinate) at a final con- 
centration of 1% for 1 h at 37°C. After incubation, the samples were ultracentri- 
fuged at 150,000g for 30 min at 4°C. The supernatant was discarded and the pellet 
was resuspended in 25 11 Buffer F (10 mM Tris (pH 8.0), 1 mM EDTA) to obtain 
the insoluble protein fraction (S2). Equal parts of 2x laemmli buffer (Bio-Rad) 
containing 5% (3-mercaptoethanol was added to each fraction (S1 and S2) and 
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boiled at 100°C for 15 min to prepare the sarkosyl-soluble and -insoluble protein 
lysates. For total protein lysate, 90 11 of the homogenate (half brain in 5X volume 
Buffer I) was added to 110, of Buffer T (2% SDS, 50 mM Tris (pH 7.4), 274 mM 
NaCl, 5 mM KCI, 5 mM EDTA, 1% Triton-X-100, 1 mM PMSE, X Halt Protease 
and Phosphatase Inhibitor Cocktail (Thermo)). The samples were then sonicated 
and centrifuged at 16,000g for 15 min at 4°C to remove debris. The supernatant was 
removed and added to equal parts 2 x laemmli buffer with 5% 3-mercaptoethanol 
and boiled at 100°C for 15 min to prepare the total protein lysate. Western blotting 
was performed as previously described”’. Blots were probed with antibodies for 
total tau (Thermo Fisher; MN1000, 1:5,000) and phospho-tau $202/T205 (Thermo 
Fisher; MN1020, 1:1,000). Ponceau-S staining was performed to normalize lysate 
loading for the total- and S1-fraction lysates. Quantification was performed using 
ImageJ as previously described’. 

Quantitative RT-PCR. RNA extraction, cDNA synthesis and RT-qPCR analysis 
were performed on hippocampi and cortical samples from mouse brains as previ- 
ously described”*. Primers used to amplify Casp8, GFP, p16", p194¥, p21, Pail, 
II-6, Il-1b and Cd11b were as previously described?*. The following additional 
primers were used: Gfap forward 5'/-CCTTCTGACACGGATTTGGT-3’, reverse 
5'-TAAGCTAGCCCTGGACATCG-3’; $100b forward 5’-CCGGAGTACTGG 
TGGAAGAC-3’, reverse 5’-GGACACTGAAGCCAGAGAGG-3’; Aqp4 
forward 5’-TGAGCTCCACATCAGGACAG-3’, reverse 5’-TCCAGCTCGATCTT 
TTGGAC-3’; Cx3cr1 forward 5’-GTTCCAAAGGCCACAATGTC-3’, 
reverse 5'/-TGAGTGACTGGCACTTCCTG-3’; Olig2 forward 5’-CCCCA 
GGGATGATCTAAGC-3’, reverse 5'-CAGAGCCAGGTTCTCCTCC-3’; Nefl 
forward 5'-AGGCCATCTTGACATTGAGG-3’, reverse 5'/-GCAGAATGCAGAC 
ATTAGCG-3’; Thp forward 5'-GGCCTCTCAGAAGCATCACTA-3’, reverse 
5!-GCCAAGCCCTGAGCATAA-3’. Expression for all experiments was normalized 
first to Tbp. 

Immunohistochemistry and immunofluorescence staining. Mice were transcar- 
dially perfused as described in ‘Senescence-associated 3-galactosidase transmission 
electron microscopy (Gal-TEM). Brains were stored in 4% PFA overnight at 4°C 
and then cryoprotected by incubating in a 30% sucrose solution for 48 h at 4°C. 
Samples were sectioned into 30-|1M-thick coronal sections and stored in antifreeze 
solution (300 g sucrose, 300 ml ethylene glycol, 500 ml PBS) at —20°C. Nissl staining 
(bregma —2.1 to —2.4 mm), thioflavin S staining (bregma —1.4 to —1.6 mm), 
and phospho-tau $202/S205 (Thermo Fisher, MN1020; 1:500), phospho-tau T231 
(Thermo Fisher, MN1040; 1:500), phospho-tau $396 (Abcam, 109390; 1:500), and 
GFAP (Dako, Z0334; 1:500) and IBA1 (Novus, NB100-1028; 1:100) immunohisto- 
chemistry staining (bregma 1.6 to 1.0 mm and lateral 2.0 to 2.7 mm) was performed 
on free-floating sections as described”®>-?”. NeuN staining (EMD, MAB377; 1:200) 
of five sections (between bregma — 1.3 to —2.5) to measure the dentate gyrus area 
was performed as previously described”*. For cellular proliferation assays, mice 
were injected with EdU (Carbosynth, NE08701; 75 mg kg~') intraperitoneally 24 h 
before euthanasia. Imaging of EdU-positive cells (lateral 0.75 to 1.25 mm) was 
performed following the manufacturer's instructions (Invitrogen Click-iT EdU 
Alexa Fluor 488 Imaging Kit, C10337). IBA1 (Wako, 019-19741; 1:500) immuno- 
fluorescent staining and IBA1/EdU colocalization assessments were performed as 
previously described”*. TUNEL staining (lateral 0.75 to 1.25 mm) was performed 
according to the manufacturer's instructions (Roche In situ Cell Death Detection 
Kit, Fluorescein: 11684795910). Thioflavin S, EdU/IBA1 colocalization, and in 
vivo TUNEL-stained images were acquired on a Zeiss LSM 780 confocal system 
using multi-track configuration. 

Single-cell preparation and FACS. Dissociation of cerebral tissue was performed 
using the Adult Brain Dissociation kit from Miltenyi (MACS, 130-107-677), 
according to the manufacturer’s instructions. Samples were then incubated 
with a viability dye, LIVE/DEAD Aqua (Invitrogen, L34966; 1:250) followed 
by incubation with CD11b eFluor 450 (eBioscience, 48-0112-80, 1:100), CD45 
APC eFluor 780 (eBioscience, 47-0451-82; 1:200), Glast1 PE (Miltenyi Biotec, 
130-095-821; 1:100), O1 AF 700 (R&D Systems, FAB1327N-100UG; 1:100) and 
CD56 APC (R&D Systems, FAB7820A; 1:100). These samples were then sorted 
using a FACSAria Ilu SORP (BD Biosciences), with gating parameters created using 
FACSDiva 8.0.1 (BD Biosciences). A precise gating strategy was used to maximize 
the purification of each isolated cell population. In brief, populations were isolated 
first by a negative report of the viability dye indicating the cell is viable (Extended 
Data Fig. 4), followed by a positive report of the desired marker, then negative 
reports of the other labels used. This strategy allowed for live cells containing only 
the desired marker to be sorted, while eliminating dead cells. Cells were sorted 
directly into lysis buffer and RNA was extracted with RNeasy Micro kits according 
to the manufacturer's instructions (Qiagen, cat no. 74004). cDNA synthesis and 
RT-qPCR analysis were performed as described above. 

Novel-object recognition. Novel-object recognition testing was performed as 
previously described">. In brief, mice from each cohort were acclimatized to a 
50cm x 50 cm testing environment for a period of two minutes. After acclimatization, 
the mice were removed, the testing area was cleaned with 70% ethanol, and two 
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identical scented candles were placed in either corner of the testing area approx- 
imately 5 cm from either wall. Mice were reintroduced, and the ratio of both the 
number of visits and the time spent at each candle was recorded for a period of 
ten minutes. Recording was performed from above (Panasonic WV-CP294) and 
all video files were analysed with TopScan Version 3.00 (Clever Sys). Afterwards, 
the mice were removed, the testing area cleaned with 70% ethanol, and one candle 
was replaced with a novel scent. The mice were reintroduced and the number of 
visits and total time per candle was recorded as before. Testing was also performed 
with visual stimuli, by placing identical toy brick towers at either corner and then 
replacing with a different toy brick tower in the testing phase, using the same 
experimental paradigm monitoring for the number of investigations. 

In vitro astrocyte and microglia culture. Astrocyte and microglia primary 
cultures were prepared in tandem from mixed glial cultures as previously 
described”. C57BL/6 wild-type and ATTAC pups (postnatal day (P)0-P3) were 
euthanized, and the cerebellum was discarded. Meninges were removed from the 
remaning tissue using forceps and a dissection scope. Cleaned cerebral tissue was 
placed in chilled Earle’s Balanced Salt Solution with HEPES (EBSH) (NaCl 120 
mM, NaH2PO, 10 mM, KCl] 2.5 mM, CeH120¢ 20 mM, HEPES 20 mM, NaHCO; 
10 mM, BSA 0.3%, H20) until the remaining mice were euthanized, and then mice 
were pooled together on the basis of genotype (3-4 brains per group). The tissue 
was minced using a razor blade and dissociated by shaking in a 0.025% Trypsin/ 
EBSH solution at 37°C for 30 min. FBS and MgSO, (3.82%)/ DNase I (1 mg ml!) 
were added, and the sample was placed on ice for 5 min to halt trypsinization. 
Samples were mixed and centrifuged at 200g at 25°C for 5 min. The supernatant 
was discarded, and the remaining pellet was resuspended in EBSH. Tissue was trit- 
urated using a 1 ml pipette to completely dissociate the sample and allowed to settle 
for 5 min to remove large debris. Samples were then transferred to clean tubes and 
underlaid with a 4% BSA/EBSH solution. The tissue was then centrifuged at 100g at 
25°C for 8 min. Cells were counted using trypan blue and a haemocytometer and 
plated on a poly-p-lysine-coated T75 dish (7-10 million cells per flask) with glial 
cell culture medium (GCM) consisting of DMEM with 10% FBS, sodium pyruvate 
(1 mM), Pen/Strep (500 1g m7), and InvivoGen Primocin (100 bg ml”). Cultures 
were grown for 14 days (37°C, ambient O2) with medium changes every 4 days. 
Microglia were isolated as previously described™ using the EasySep Mouse CD11b 
Positive Selection Kit from Stem Cell (cat. no. 18970). Microglia were collected and 
plated on 10-well glass slides (5,000 cells per well) and cultured for 6 days in GCM 
with LADMAC-conditioned medium (20%, provided by the Howe laboratory) 
before further experimentation. This conditioned medium aids in the proliferation 
and maintenance of microglia cultures through the secretion of macrophage colo- 
ny-stimulating factor by the LADMAC cells*!. Microglia were allowed to proliferate 
for 6 days before experimentation. The mixed glial culture flow-through from 
the EasySep CD11b kit was replated in GCM on a poly-p-lysine-coated T75 dish 
(10 million cells per flask). These cultures then underwent purification for astrocytes 
as previously described”. After 48 h, flasks were placed on an orbital shaker and 
agitated at 200 r.p.m. for two 24-h periods with medium refreshed once during 
and after the shaking. Flasks were then exposed to GCM containing liposomal 
clodronate (Clodrosome, 8909; 100 jg ml~') for 72 h to remove any remaining 
microglia from the culture. The liposomal clodronate medium was then removed 
and culture plates washed before further experimentation. 

Microglia activation and TUNEL staining. Microglia samples were exposed to 
medium containing IFN (R&D Systems, 285-IF; 200 ng ml”), lipopolysaccha- 
rides (LPS) (Sigma, L2654; 100 ng ml“) or a combination of both for a period of 
24h to induce an inflammatory response”. Cells were then processed for immu- 
nofluorescence to determine inflammation state as previously described*’. Anti- 
CD11b antibody (Bio-Rad, MCA711G; 1:500) and goat anti-rat AlexaFluor 594 
(Invitrogen, A-11007; 1:500) staining was counterstained with DAPI (Invitrogen, 
D1306; 1:1,000). To assess AP-mediated cell clearance specificity, activated or 
basal microglia were exposed to AP20187 (Clontech, 635059; 10 nM or 100 nM) 
for a period of 24h. TUNEL staining was then performed according to the 


manufacturer’s instructions (Roche In situ Cell Death Detection Kit, Fluorescein: 
11684795910). All imaging was performed using an Olympus BX53 Fluorescence 
microscope and DP80 digital camera. Analysis was performed using the Fiji 
distribution of Image] (version 1.51n)*4. To obtain a TUNEL positive percentage, 
a region of interest was defined using Li Auto Thresholding of the DAPI channel, 
and the colocalization percentage was calculated using the colocalization threshold 
plugin bound by that region. 

IncuCyte tracking of basal and activated astrocytes. To track basal and activated 
astrocytic response to AP, astrocytes were plated in a 48-well culture plate (10,000 cells 
per well) and placed into the IncuCyte S3 Live-Cell Analysis System. The IncuCyte 
System is a time-lapse imaging system that records changes in cell culture through 
photographic capture of the culture well within the incubator. Cultures were 
acclimatized to the system for a period of 6 h, then exposed to medium containing 
IFN (R&D Systems, 285-IF; 200 ng ml~'), LPS (Sigma, L2654; 100 ng ml~!) or 
a combination of both for a period of 24h to induce an inflammatory response”. 
Cells were also plated on 10-well slides and processed in tandem for immunofluo- 
rescence staining to verify activation status with anti-GFAP (DAKO, Z0334; 1:500) 
and counterstained with DAPI (Invitrogen, D1306; 1:1,000). After activation, astro- 
cytes were exposed to AP20187 (Clontech, 635059; 10 nM or 100 nM) fora period 
of 24h. The IncuCyte-captured phase images of each culture well were taken every 
30 min over this period using the following settings: segmentation adjustment: 
0.8, hole fill: 450, adjust size (pixels): —1, Minimum area (|1m7): 0.1. The phase 
confluency difference was calculated by subtracting the final phase confluency of 
each image from its initial value. 

Reporting summary. Further information on research design is available in the 
Nature Research Reporting Summary linked to this paper. 


Data availability 
All source data and exact P values (if applicable) for every figure are included in 
the supporting information that accompanies the paper. 
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Extended Data Fig. 1 | Senescent cells accumulate in PS19 mice. month wild-type group. Data are mean + s.e.m. *P < 0.05; **P < 0.01; 
RT-qPCR analysis of senescence-associated genes in hippocampi (left)and — ***P < 0.001 (one-way ANOVA with Tukey’s multiple comparisons test). 
cortices (right) of three- and ten-month-old male mice. Number of mice Exact P values can be found in the accompanying Source Data. 
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Extended Data Fig. 2 | AP-mediated clearance selectively removes 
senescent cells that accumulate in the brains of PS19;ATTAC mice. 
RT-qPCR analysis of the expression of senescence markers in the 
hippocampus (left) and cortex (right) of six-month-old female mice, 
treated with either vehicle (-AP) or AP20187 (+-AP). Number of 
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mice is as indicated; normalized to the ATTAC - AP group. Data are 

mean +s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (one-way ANOVA with 
Tukey’s multiple comparisons test). Exact P values can be found in the 
accompanying Source Data. 
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Neurons 


Extended Data Fig. 3 | Neurons do not exhibit X-Gal crystals upon 
Gal-TEM imaging. Representative electron microscopy image of 
neurons after SA-$-Gal staining from a six-month-old, vehicle-treated 
PS19;ATTAC male mouse (n =3 male mice, 2 independent experiments). 
The image has been artificially coloured to show individual cell bodies. 
Scale bar, 10 um. 
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Extended Data Fig. 4 | Increased expression of senescence- 
associated genes is observed in astrocytes and microglia isolated 
from PS19 mice. a-e, Gating strategy (a) for FACS isolation of living 
astrocytes (b), microglia (c), oligodendrocytes (d) and neuron- 
enriched CD56* cells (e) from cortices of six-month-old wild-type 
and PS19 mice. b, Astrocyte (CD11b” CD45~017- GLAST* CD56") 
fraction (left) and RT-qPCR analysis (right). c, Microglia 
(CD11b*CD45+tO1GLAST*CD56_ ) fraction (left) and RT-qPCR 
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analysis (right). d, Oligodendrocyte (CD11b~ CD45~O1*GLAST- CD56") 
fraction (left) and RT-qPCR analysis (right). e, Neuron-enriched CD56 
(CD11b~-CD45~ O01 GLAST CD56*) fraction (left) and RT-qPCR 
analysis (right). Individual numbers of independent mouse cell population 
isolations are indicated in the parentheses above p16’"*** columns (2 
independent experiments). Data are mean 4 
(unpaired two-sided t-tests with Welch’s correction). Exact P values can be 
found in the accompanying Source Data. 
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Extended Data Fig. 5 | Verification of the identity of cell populations 
isolated by FACS. a-d, RT-qPCR analysis of cell-identity markers from 
cell populations isolated from six-month-old wild-type and PS19 mice: 
Aqp4 expression enriched in astrocytes (a), Cx3cr1 expression enriched in 
microglia (b), Olig2 expression enriched in oligodendrocytes (c) and Nefl 
expression enriched in neurons (d). Expression is normalized to intact 
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cortices of six-month-old wild-type mice (n = 4 biologically independent 
cell isolations for each group, 2 independent experiments). Data are 
mean + s.e.m. ***P< <0.001 (one-way ANOVA with Tukey’s multiple 
comparisons test). Exact P values can be found in the accompanying 
Source Data. 
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Extended Data Fig. 6 | AP administration does not erroneously astrocytes treated with IFNy, LPS or a combination of both as described in 


eliminate non-senescent glial cells isolated from ATTAC mice. a,CD11b a(n =3 biologically independent samples). d, Quantification of the change 


staining of primary microglia treated with IFNy (200 ng ml~!), LPS in confluency over 24 


h in basal or activated astrocytes (n = 4 biologically 


(100 ng ml“) or a combination of both (n= 3 biologically independent independent cultures of each genotype and treatment). Scale bars, 100 1m 


samples). b, Quantification of TUNEL-positive bodies in basal or activated (a, c). Data are mean 
microglia (n = 4 wild-type and 8 ATTAC cultures for each treatment with Tukey’s multiple 


t s.e.m. *P < 0.05; ***P < 0.001 (one-way ANOVA 
comparisons test (b, d)). Exact P values can be found 


group, 2 independent experiments). c, GFAP staining of primary in the accompanying Source Data. 
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Extended Data Fig. 8 | Senescent cells promote gliosis. a, RT-qPCR 
analysis of Gfap, S100b and Cd11b in the hippocampi of six-month-old 
male mice (n=5 mice per group; normalized to the ATTAC - AP group). 
b, RT-qPCR analysis as in a in hippocampi of six-month-old female mice 
(number of mice as indicated; normalized to the ATTAC - AP group). 

c, Representative GFAP immunohistochemistry staining in the 
hippocampus of six-month-old vehicle and AP-treated ATTAC and 


PS19;ATTAC —AP 


PS19;ATTAC +AP 


PS19;ATTAC female mice (n =4 mice per group, 2 independent 
experiments). d, Representative IBA1 staining in the hippocampus of six- 
month-old vehicle and AP-treated ATTAC and PS19;ATTAC female mice 
(n=4 mice per group, 2 independent experiments). Scale bar, 100 j1m (c) 
and 50 tm (d). Data are mean +s.e.m. *P < 0.05; **P< 0.01; ***P< 0.001 
(one-way ANOVA with Tukey’s multiple comparisons test). Exact P values 
can be found in the accompanying Source Data. 
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Extended Data Fig. 9 | AP treatment attenuates tau phosphorylation. 
a, Ponceau-S loading controls for the western-blot lysates of the whole 
brain of six-month-old mice (shown in Fig. 3a) for total tau (left) and 
phosphorylated tau (S202/T205; right). b, Quantification of the western- 
blot analysis of the whole brain of six-month-old mice for soluble tau 
(left), soluble phosphorylated tau (S202/T205; middle), and insoluble 
phosphorylated tau (S202/T205; right). Numbers of biologically 


independent mice are indicated in parentheses, data are from >3 
independent experiments. c, Immunostaining of the cortex of six-month- 
old mice for tau protein phosphorylated at T231 (top) and $396 (bottom; 
n=4 mice per group, 2 independent experiments). Scale bar, 100 jum. Data 
are mean + s.e.m. ***P < 0,001 (one-way ANOVA with Tukey’s multiple 
comparisons test). Exact P values can be found in the accompanying 
Source Data. 
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Extended Data Fig. 10 | Vision-based novel-object discrimination 
remains intact in AP-treated PS19;ATTAC mice. Objects used for 
novel-object recognition during the training and testing phases for visual 
discrimination (left) and the average ratio for the number of investigations 
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(right, n = 8 female mice per group). Data are mean +s.e.m. **P <0.01; 
*** P< 0.001 (two-way ANOVA with Tukey’s multiple comparisons test). 
Exact P values can be found in the accompanying Source Data. 
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Temporal development of the gut microbiome in 
early childhood from the TEDDY study 
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The development of the microbiome from infancy to childhood is 
dependent on a range of factors, with microbial-immune crosstalk 
during this time thought to be involved in the pathobiology of later 
life diseases!~® such as persistent islet autoimmunity and type 1 
diabetes!*!?, However, to our knowledge, no studies have performed 
extensive characterization of the microbiome in early life in a large, 
multi-centre population. Here we analyse longitudinal stool samples 
from 903 children between 3 and 46 months of age by 16S rRNA gene 
sequencing (n= 12,005) and metagenomic sequencing (n= 10,867), 
as part of the The Environmental Determinants of Diabetes in the 
Young (TEDDY) study. We show that the developing gut microbiome 
undergoes three distinct phases of microbiome progression: a 
developmental phase (months 3-14), a transitional phase (months 
15-30), and a stable phase (months 31-46). Receipt of breast milk, 
either exclusive or partial, was the most significant factor associated 
with the microbiome structure. Breastfeeding was associated with 
higher levels of Bifidobacterium species (B. breve and B. bifidum), 
and the cessation of breast milk resulted in faster maturation of the 
gut microbiome, as marked by the phylum Firmicutes. Birth mode 
was also significantly associated with the microbiome during the 
developmental phase, driven by higher levels of Bacteroides species 
(particularly B. fragilis) in infants delivered vaginally. Bacteroides 
was also associated with increased gut diversity and faster 
maturation, regardless of the birth mode. Environmental factors 
including geographical location and household exposures (such as 
siblings and furry pets) also represented important covariates. A 
nested case-control analysis revealed subtle associations between 
microbial taxonomy and the development of islet autoimmunity or 
type 1 diabetes. These data determine the structural and functional 
assembly of the microbiome in early life and provide a foundation 
for targeted mechanistic investigation into the consequences of 
microbial-immune crosstalk for long-term health. 

In this study, a total of 12,500 stool samples from 903 children from 
three European countries (Germany, Sweden and Finland) and three 
US states (Colorado, Georgia and Washington) were analysed. The chil- 
dren represent those who seroconverted to islet cell autoantibody pos- 
itivity or developed type 1 diabetes (T1D) and matched controls. Stool 
samples were collected, on average, monthly from around 3 months of 
age as part of the The Environmental Determinants of Type 1 Diabetes 
in the Young (TEDDY) study'’. After rarefaction and limiting sam- 
ples to 3-46 months of age, we analysed the microbiome (16S rRNA 
gene sequencing, n = 12,005 samples from 903 children; metagenomic 


sequencing, n= 10,867 samples from 783 children) and functional 
metagenome (metagenomic sequencing only) from longitudinal stool 
samples (Extended Data Table 1). A companion paper by Vatanen 
et al.“ focused exclusively on metagenomic sequencing data. 

In this cohort of children that are at-risk for developing islet auto- 
immunity (IA) or T1D, we aimed to (1) characterize definitively the 
longitudinal gut microbiome development from 3 to 46 months of age; 
(2) determine selected maternal and postnatal influences on the developing 
bacterial community during this same time period of early development; 
and (3) use a nested case-control analysis to investigate the potential of the 
microbiome as a predictor for the development of IA or T1D. 

A general overview of bacterial taxonomic and functional pathway 
development is provided in Supplementary Note 1 and Extended Data 
Fig. 1. Dirichlet multinomial mixtures (DMM) modelling was applied 
to 16S rRNA gene sequencing (Fig. 1) and metagenomic sequencing 
data (Extended Data Fig. 2). All samples from 3 to 46 months of age 
were included, and 16S rRNA gene sequencing profiles formed ten 
clusters (based on lowest Laplace approximation) (Fig. 1a). Bacterial 
richness and diversity increased in each cluster (Fig. 1a, b). Using linear 
mixed-effects modelling of the top five phyla and Shannon's diversity 
index, we determined three distinct phases of microbiome progression: 
a developmental phase (months 3-14), a transitional phase (months 
15-30), and a stable phase (>31 months), in which all five phyla and 
the Shannon diversity index changed significantly during the devel- 
opmental phase, two phyla (Proteobacteria and Bacteroidetes) and the 
Shannon diversity index changed significantly during the transitional 
phase, and all phyla and the Shannon diversity index were unchanged 
during the stable phase (Fig. 1c). Bifidobacterium dominated during the 
initial developmental phase, in which 20% of individuals transitioned 
from cluster 1 to cluster 3 (Bifidobacterium was dominant in both 
clusters). As infants aged, the microbiomes of their stools diversified 
into clusters 4-8 during months 15-30 (that is, the transitional phase). 
Microbiome stabilization, in which infants’ samples remained in the 
same cluster at consecutive time points, was observed from month 31 
of life. Clusters 8-10 were the most dominant during the stable phase, 
with these clusters characterized by high alpha diversity and dominance 
of genera within the Firmicutes phyla. The three microbiome phases 
and changes in taxa are consistent with other cohorts!>-'8 and were 
supported by the metagenomic sequencing data (Supplementary Note 2 
and Extended Data Fig. 2). 

We next sought to determine the significant factors associated with 
the microbiome profiles from 16S rRNA gene sequencing (genus level), 
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Fig. 1 | DMM clustering of 16S rRNA gene sequencing data (n = 12,005). 
The entire dataset formed ten distinct clusters based on lowest Laplace 
approximation. a, Heat map showing the relative abundance of the 25 
most dominant bacterial genera per DMM cluster. Taxa names in square 
brackets are in need of formal taxonomic revision. b, Box plots showing 
the alpha diversity (richness and Shannon's diversity) per each DUM 
cluster. The centre line denotes the median, the boxes cover the 25th and 
75th percentiles, and the whiskers extend to the most extreme data point, 
which is no more than 1.5 times the length of the box away from the box. 
Points outside the whiskers represent outlier samples. c, Transition model 
showing the progression of samples through each DMM cluster per each 


metagenomic sequencing taxa (species level), and functional metabolic 
capacity (Kyoto encyclopedia of genes and genomes (KEGG) mod- 
ules) (Supplementary Table 1). For statistical analysis, covariates were 
analysed by stratifying the samples into discrete time points (months 
3-6, 7-10, 11-14, 15-18, 19-22, 23-26, 27-30 and 31-40), and only 
the first sample from each infant was included. Information about 
the underlying grouping of each covariate is shown in Extended Data 
Table 1. Several covariates were significantly associated with the genus 
and species level bacterial community profiles between months 3 and 
18 of age, particularly at the first time point of 3 to 6 months (Fig. 2). 
Conversely, bacterial metabolic potential was associated exclusively 
with the consumption of breast milk from months 3 to 14 of life (Fig. 2). 

Breastfeeding explained the greatest amount of variance from months 
3 to 14 of life, after which only 10% of infants received any breast milk 
(Fig. 2). Breastfeeding had a comparable influence on microbiome 
development, regardless of whether it was exclusive or together with 
formula milk and/or solids (Fig. 3a). At the genus level, the receipt 
of breast milk was most significantly associated with Bifidobacterium 
throughout each time window (Supplementary Table 2). At the spe- 
cies level, breastfeeding was significantly associated with 121 different 
bacterial species, with higher levels of B. bifidum, B. breve, B. den- 
tium, Lactobacillus rhamnosus and Staphylococcus epidermidis, and 
lower levels of Escherichia coli, Tyzzerella nexilis, Eggerthella lenta, 
Ruminococcus torques and Roseburia intestinalis in infants that were 


584 | NATURE | VOL 562 | 25 OCTOBER 2018 


DMM cluster 4.0 
Bifidobacterium 

Enterococcus 08 
Veillonella 

Streptococcus 

Enterobacter 

Escherichia 

Bacteroides 

Akkermansia 

[Ruminococcus] gnavus group 
Erysipelatoclostridium 
Tyzzerella 02 
Lachnoclostridium 

Blautia mo 
Lachnospiraceae UCG-008 

Anaerostipes 

[Eubacterium] rectale group 

Clostridium sensu stricto 1 

Romboutsia 

Intestinibacter 

Fusicatenibacter 

Ruminococcus 

Faecalibacterium 

Subdoligranulum 

[Eubacterium] coprostanoligenes group 
Dialister 


Relative abundance 


| i NAM IL Hl Il MH) 


mi ji \ i 


uN ls 


15 27 31 35 39 43 
: to es to to to to to 
30 34 38 42 46 = 


Phyla 


Firmicutes 
336 273 200 153 84 64 Proteobacteria 
H Developmental phase [JP < 0.001 


Bacteroidetes 
Verrucomicrobia 


i MP <0.001 cere 
' MP < 0.001 Diversity 
1 MP = 0.003 H’ Shannon 
i IP <0.001 
1 H’P <0.001 ss 
i Transition 
; Transitional phase JP = 0.041 frequency 
~m | MIP = 0.006 00% 
H’P.<0.001 02% 
4% 
B 6% 
ci 8% 
) 0, 
ee S) table phase = be 
' B 14% 
, B 16% 
1 B 18% 
H B 20% 
aoa ited bein 1 @ 22% 
i @ 24% 
' 
1 
1 


time point, from months 3 to 46 of life. Dashed boxes show the three 
phases of microbiome progression (developmental, transitional and stable 
phase). Solid squares next to the labels denote the significant changes 

in phyla and Shannon's diversity (H') per phase based on multiple linear 
regression. All phyla and the H' were significant in the developmental 
phase, two phyla and the H' were significant in the transitional phase, and 
no phyla or the H' were in the stable phase. Nodes and edges are sized 
based on the total counts. Nodes are coloured according to DMM cluster 
number and edges are coloured by the transition frequency. Transitions 
with less than 4% frequency are not shown. Results are further supported 
by the metagenomic sequencing data in Extended Data Fig. 2. 


breastfed (a full list of significant taxa and associated P values are shown 
in Supplementary Table 2). Bifidobacterium spp. and Lactobacillus 
spp. exist viably in breast milk and Staphylococcus spp. colonize the 
areolar skin, thus these species can be directly transferred from the 
mother to infant!?-*”. B. longum was not significantly associated with 
breastfeeding and remained in higher relative abundance compared to 
other Bifidobacterium spp. (Fig. 3b). In the companion manuscript by 
Vatanen et al.'*, most B. longum strains were found to contain genes 
from the human milk oligosaccharide (HMO) gene cluster, whereas 
after the cessation of breast milk, most B. longum strains no longer 
carried these genes. This potentially reflects the ability of B. longum 
subsp. infantis and subsp. longum to use mammalian- and plant-derived 
oligosaccharides, respectively?*4. B. bifidum also persisted after the 
cessation of breastfeeding, and this species is able to switch HMO to 
mucin degradation”‘. Vatanen et al.’* show experimentally that B. breve, 
B. longum and B. bifidum, which make up DMM clusters 1-3 (Extended 
Data Fig. 2), have distinct profiles of sugar utilization, suggesting that 
the different nutrient availability between infants can promote the col- 
onization of specific Bifidobacterium species. 

As the infant ages, the proportion of solid foods in the diet increases 
(and the amount of breast milk decreases)". In the current study, the 
Shannon diversity index between infants receiving some breast milk 
and infants no longer receiving breast milk began to converge over 
time, probably as a result of a reduced proportion of breast milk in the 
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Fig. 2 | Significance and explained variance of 22 microbiome 
covariates modelled by EnvFit across all data types. Horizontal bars 
show the amount of variance (r”) explained by each covariate in the model 
as determined by EnvFit. The groups within each covariate are detailed in 
Extended Data Table 1. Covariates are coloured based on overall metadata 
group. Significant covariates (false discovery rate (FDR) P< 0.05) are 
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diet and therefore less dominance of Bifidobacterium (Fig. 3c). Infants 
receiving some breast milk had significantly lower diversity when com- 
pared with infants no longer receiving breast milk across all phases 
(P <0.001 for all phases), owing to the dominance of Bifidobacterium 
in infants receiving breast milk. To explore microbiome maturation 
further, we used microbiota age and microbiota-by-age Z-scores 
(MAZ) as previously described”, with a model of 20 operational taxo- 
nomic units (OTUs) that explained 72% of the variance (compared to 
74% when including all OTUs in the model) (Extended Data Fig. 3). 
Comparably, the microbiota age and MAZ scores were significantly 
reduced in infants receiving some breast milk in the developmental 
and transitional phases (both P< 0.001 for microbiota age and MAZ 
scores), but converged in the stable phase (microbiota age P= 0.331 
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represented in bold font. Asterisk denotes the significant covariates at each 
time point. BMI, body mass index; wtgain, weight gain. a, Microbiome 
profiles at the genus level based on 16S rRNA gene sequencing data 
(n=4,069). b, Microbiome profiles at the species level based on 
metagenomic sequencing (n = 3,843). c, Functional metagenomic capacity 
at the module level based on metagenomic sequencing (n = 3,843). 


and MAZ score P=0.196) (Fig. 3d). After the cessation of breast milk, 
110 unique bacterial species (89 from the Firmicutes phylum) were sig- 
nificantly increased from months 3 to 14 of life alone (Supplementary 
Table 2). The suppression of Firmicutes while in receipt of some breast 
milk was recently noted”!. Together, these data support existing reports 
that the maturation of the gut microbiome is driven by the cessation of 
breast milk (rather than the introduction of solid foods), hallmarked 
by increased levels of Firmicutes'”?!”°, 

Breastfeeding was the only covariate that was significantly associ- 
ated with metabolic potential (Fig. 2). Plotting all significant modules 
(Supplementary Table 1) from the first three time points (months 
3-14) showed clear clustering based on the receipt of breast milk, 
with comparability in the metabolic capacity regardless of the time 
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Fig. 3 | Breastfeeding status was the most significant microbiome 
covariate associated with all datasets throughout the first year of life. 
Breastfeeding status was significantly associated with microbiome profiles 
over the first three time points (months 3-14, n = 2,257; Supplementary 
Table 1). Curves show locally weighted scatterplot smoothing (LOESS) 
for the data per category, and shaded areas show permutation-based 95% 
confidence intervals for the fit. a, Non-metric multidimensional scaling 
(NMDS) ordination plots showing the mean centroid of each breastfeeding 
status group. Plots include only the first sample obtained from a patient 
within a given time point; months 3-6, 7-10 and 11-14. Centroid size 
based on number of samples and the bars represent the +95% confidence 
interval. b, Plots showing the receipt of breast milk from months 3 to 40 


point (Fig. 3e). Modules most significantly associated with breastfed 
infants were from the ‘carbohydrate and lipid metabolism’ pathway 
and included ‘fatty acid biosynthesis’ (M00083 and M00082) and ‘beta- 
oxidation, acyl-CoA synthesis’ (M00086) (Supplementary Table 2). 
This is in accordance with previous work that found that genes that 
relate to the biosynthesis of fatty acids are increased during infancy in 
breastfed infants'”*”?8, Conversely, infants not receiving breast milk 
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of age compared to the relative abundance of the six most abundant 
Bifidobacterium species over the same period (n= 11,717). c, Longitudinal 
Shannon diversity index from months 3 to 40 of age (1 = 11,717). 

d, Longitudinal development of the microbiome maturation based on the 
microbiota age and MAZ score against the age of the infant at sampling 
(n=11,717). e, Heat map showing the mean abundance of all significant 
modules as determined by MaAsLin analysis at each of the first three time 
points. The corresponding pathway for each module is also presented. BM, 
breast milk. f, Stacked bar plots showing the abundance of each significant 
module binned at the pathway level. Abundance plotted per bacterial 
species, with the five most significant species associated with breastfed and 
non-breastfed infants, respectively. 


showed rapid turnover of the metabolic capacity, and the ‘dicarboxylate- 
hydroxybutyrate cycle’ (M00374) and ‘reductive acetyl-CoA (Wood- 
Ljungdahl)’ (M00377) pathways were increased. Modules relating 
to vitamins B7 (‘nucleotide and amino acid metabolism pathway; 
M00573, M00577 and M00123) were also increased in all time points 
up to 14 months in non-breastfed infants, a function that is associated 
with the adult microbiome”®. 
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By mapping reads with genomic coordinates that overlap with known 
KEGG orthologues to KEGG modules (M), we were able to directly 
determine from which taxa each gene orthology (and thus module) 
was derived (see Methods). Each pathway from which each significant 
module was derived was plotted against the main species discriminat- 
ing breastfeeding status (Supplementary Table 2). In breastfed infants, 
B. breve accounted for the highest number of significant modules in 
early life, and was replaced by B. bifidum after 6 months of life (Fig. 3f). 
In non-breastfed infants, E. coli primarily accounted for the signifi- 
cant modules between 3 and 14 months of life (Fig. 3f). This provides 
further evidence that the gut microbiome rapidly matures after the 
cessation of breast milk, both at the taxonomic and functional levels. 

The TEDDY study was powered to detect microbiome associations 
with the development of IA and T1D based on a specific 1:1 nested 
case-control study design, from two nested case-control studies (IA 
or T1D), using risk set sampling”’. The analytical cohort consisted of 
a subset with an equal number of samples for each case-control pair. 
The IA cohort consisted of 632 children and 6,194 stool samples and 
the T1D cohort consisted of 196 children and 1,540 stool samples, as of 
31 May 2012 (Supplementary Table 3). The temporal alpha diversity 
(both richness and Shannon's diversity), microbiota age and MAZ 
scores were comparable between cases and matched controls for both 
the IA and T1D groups (all P > 0.05; Extended Data Fig. 4a—h). The 
relative abundance of the top 50 most abundant genera from 16S 
rRNA gene sequencing showed only subtle compositional differences, 
with higher relative abundance of an unclassified Erysipelotrichaceae 
(P=0.019) in cases of [A (Supplementary Table 3). In the T1D and 
control cohort, five bacterial genera were associated with T1D onset, 
with Parabacteroides the most significant (P < 0.001). Eleven bac- 
terial genera were lower in T1D cases, including four unclassified 
Ruminococcaceae, Lactococcus (P=0.020), Streptococcus (P=0.032), 
and Akkermansia (P=0.045) (Supplementary Table 3). 

Conditional logistic regression models showed no significant associa- 
tions between either the numbers of unique states exhibited or the num- 
ber of transitions between different states per subject for IA (Extended 
Data Fig. 4i and Supplementary Table 3). The lack of associations was 
consistent in T1D, with the exception that cases exhibited fewer unique 
states 6-12 months before the onset of T1D (P=0.032) (Extended Data 
Fig. 4j and Supplementary Table 3). Notably, the 6-12 months before 
T1D onset group consisted of the lowest number of samples for any of 
the time points (1 = 67 subjects per group), and thus the statistically 
significant result should be interpreted with caution. Overall, the con- 
ditional logistic regression models of community dynamics suggest that 
microbiome stability was not strongly related to the onset of IA or T1D. 

Further analysis of covariates that were significant at several time 
points and/or consistently significant by 16S rRNA gene sequencing 
and metagenomics are presented in Supplementary Note 3. In brief, 
birth mode was significantly associated with microbiome develop- 
ment over the first year of life, with higher levels of Bacteroides spp. 
in infants that were delivered vaginally (Extended Data Fig. 5). This 
was generally consistent across the different breast milk exposure 
groups and geographical locations (Extended Data Fig. 6). Differences 
between geographical locations occurred from 3 to 22 months of life 
(Supplementary Table 1), although the core microbiome was consist- 
ent (Supplementary Table 4), and diversity, microbiota age and MAZ 
scores had comparable trajectories across each location (Extended Data 
Fig. 7a-c). Household exposures (for example, living with siblings and 
with furry pets) were also associated with differences in the microbi- 
ome profiles in early life, in which infants living with siblings and/or 
with furry pets showed accelerated rates of maturation of the microbi- 
ome (Extended Data Fig. 7d-i). 

The TEDDY population offers a robust analysis of gut microbiome 
development of 903 infants from months 3 to 46 of age, with regular 
sampling (more than 12,000 stool samples), extensive metadata, and 
the use of both amplicon and metagenomic sequencing. We showed 
that the first year of life is a key phase for the development of the 
microbiome, with the receipt of breast milk being the main factor that 
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influences microbiome development over this period. Birth mode, 
geographical location, household siblings and furry pets were also 
associated with the microbiome over this period. We considered the 
first year of life as developmental, the second year of life as transitional, 
and from year three of life the microbiome stabilized. These precise 
ages may shift when investigators include samples before month 3 or 
beyond month 46 of life. 

The current cohort is largely white, non-Hispanic and is drawn from 
a population of infants at high genetic risk for T1D, some of whom 
developed autoimmunity or diabetes. Temporal alpha diversity and 
community dynamics were comparable between cases and controls, 
which is in contrast to findings reported in other cohorts and may 
reflect the increased number of subjects and samples in the TEDDY 
cohort'?!?. We found subtle changes in the relative abundance of bac- 
terial genera between cases (IA and/or T1D) and matched controls. 
T1D cases showed higher levels of Streptococcus sp. and Lactococcus 
sp., which is consistent with the findings of Vatanen et al. 14 in the com- 
panion paper. In accordance with previous work, the abundance of 
Akkermansia was also higher in controls in the current study, which 
may be indicative of enhanced gut integrity’®. 

The overall microbiome development and significant covariates 
are in concordance with previous reports in westernized populations, 
although caution should be exercised when extrapolating the findings 
from the TEDDY cohort of children with risk factors of developing 
T1D to the wider population. Nevertheless, the significant covariates 
reported in the current study have been independently linked to the 
risk of later life diseases such as obesity, asthma and allergy'*. The 
current study provides several testable hypotheses of microbiome 
development in infancy, and it remains important to determine the 
potential mechanism of altered early life microbiome and the subse- 
quent effect on immune development and functioning. With a more 
comprehensive understanding of the crucial early life phases and their 
effect on health and disease, lifestyles and therapeutics can be tailored 
to support optimal microbial-immune homeostasis. 


Online content 
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METHODS 

Study population. The TEDDY Study is composed of six clinical research cen- 
tres: three in the United States (Colorado, Georgia/Florida and Washington), and 
three in Europe (Finland, Germany and Sweden). Children enrolled are followed 
prospectively from three months to 15 years with study visits every three months 
until age 4 years and every three or six months thereafter depending on autoan- 
tibody positivity. Stool samples and associated metadata were collected as of 
31 May 2012. Stool samples were collected monthly from 3 to 48 months of life, 
then every three months until the age of 10 years, and then biannually thereafter, 
into the three plastic stool containers provided by the clinical centre. Children who 
were antibody negative after 4 years of age were encouraged to submit four times 
a year even though after 4 years their visits schedule switched to biannual. Parents 
sent the stool containers at either ambient or +4°C temperature with guaranteed 
delivery within 24 h in the appropriate shipping box to the NIDDK repository if 
living in the United States or their affiliated clinical centre if living in Europe. The 
European clinical centres stored the stool samples and sent monthly bulk shipments 
of frozen stool to the NIDDK repository. The population (both cases and controls) 
is based on children at high risk for T1D based on their HLA genotype with 10% 
based on family history in addition to HLA. Detailed study design and methods 
have been previously published’*”?°, Matching factors for case and control 
children were geographical location, sex and family history of T1D. 

Metadata were collected using validated questionnaires that have been either 
published or extensively scrutinized by experts. Information about mothers, preg- 
nancy and birth was collected during the three month clinic visit by question- 
naire and included the mode of birth (vaginal birth versus Caesarean section), the 
infant’s 5-min Apgar score, pregnancy complications, information about maternal 
diabetes (T1D, type 2 diabetes (T2D) or gestational diabetes), gestational age, and 
maternal medication use (insulin, metformin, glyburide, antihypertensives) during 
pregnancy. TEDDY provides many tools, such as “The TEDDY book; to the parents 
to assist in real-time collection of all events in their child’s life to ensure bias and 
error are minimized. At each visit the study personnel will go over the TEDDY 
book with the primary caretaker and extract pertinent information using stand- 
ardized study forms. Data are extracted by trained staff members during scheduled 
visits every three months starting at 3 months of age and entered directly via stand 
forms (web forms or teleforms), which are transmitted electronically. Front-end 
constraints are used in the web application to prevent the entry of invalid data and 
The TEDDY Error Reporting and Verification System (ERVS) consists of a set of 
programs that conduct automated quality control on the data, report and resolve 
errors, an integrated database for storing error data, and a set of programs that 
generate reports for monitoring data cleaning efforts. The details of the system 
have been published*". Given the prospective nature of the TEDDY design, infor- 
mation and recall bias are greatly minimized. Because the children do not have 
event outcome at time of enrolment and are followed, there is no reason for any 
systematic differences between groups of the study participants in the accuracy of 
the information collected. 

The TEDDY study was approved by local US Institutional Review Boards 
and European Ethics Committee Boards in Colorado's Colorado Multiple 
Institutional Review Board, Georgia's Medical College of Georgia Human 
Assurance Committee (2004-2010), Georgia Health Sciences University Human 
Assurance Committee (2011-2012), Georgia Regents University Institutional 
Review Board (2013-2015), Augusta University Institutional Review Board 
(2015-present), Florida’s University of Florida Health Center Institutional Review 
Board, Washington state's Washington State Institutional Review Board (2004— 
2012) and Western Institutional Review Board (2013-present), Finland’s Ethics 
Committee of the Hospital District of Southwest Finland, Germany’s Bayerischen 
Landesarztekammer (Bavarian Medical Association) Ethics Committee, Sweden's 
Regional Ethics Board in Lund, Section 2 (2004-2012) and Lund University 
Committee for Continuing Ethical Review (2013-present). All parents or guard- 
ians provided written informed consent before participation in genetic screening 
and enrolment. The study was performed in compliance with all relevant ethical 
regulations. 

A priori power calculations using discrete Cox’s proportional hazards regres- 
sion” for the matched IA case-control study estimated 80% power, a= 0.01, two- 
sided test to detect an odds ratio > 3 for an exposure with 5% prevalence to an 
odds ratio > 1.8 for an exposure with 20% prevalence. The experiments were not 
randomized, and investigators were not blinded to allocation during experiments 
and outcome assessment. 
16S rRNA gene sequencing. 16S rRNA gene sequencing methods were adapted 
from the methods developed by the NIH-Human Microbiome Project and the 
Earth Microbiome Project***°. Bacterial DNA was extracted using the PowerMag 
Microbiome DNA isolation kit following the manufacturer's instructions. The V4 
region of the 16S rRNA gene was amplified by PCR and sequenced on the MiSeq 
platform (Illumina) using the 2 x 250 bp paired-end read protocol. The read 
pairs were demultiplexed and reads were merged using USEARCH v7.0.1090*. 
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Merging allowed zero mismatches and a minimum overlap of 50 bases, and merged 
reads were trimmed at the first base with a q < 5. A quality filter was applied 
to the resulting merged reads and those containing above 0.5% expected errors 
were discarded. Sequences were stepwise clustered into OTUs at a similarity cut- 
off value of 97% using the UPARSE algorithm*”. Chimeras were removed using 
USEARCH v7.0.1090 and UCHIME v4.2. To determine taxonomies, OTUs were 
mapped to a version of the SILVA Database** containing only the 16S V4 region 
using USEARCH v7.0.1090. Abundances were recovered by mapping the merged 
reads to the UPARSE OTUs. A custom script constructed a rarefied OTU table 
from the output files generated in the previous two steps for downstream analyses 
of taxonomic relative abundance, alpha diversity, and beta diversity (including 
UniFrac)*?. A total of 114,313,601 reads (median 8,442 reads per sample) were 
obtained from 16S rRNA gene sequencing and each sample was rarefied to 3,000 
reads. Stringent merging parameters account for the relatively low number of 
OTUs, with the number of species by metagenomics around fourfold higher than 
the number of OTUs by 16S rRNA gene sequencing. 

Metagenomic shotgun sequencing. Individual libraries constructed from each 
sample were pooled and loaded onto the HiSeq 2000 platform (Illumina) and 
sequenced using the 2 x 100 bp paired-end read protocol. The process of quality 
filtering, trimming, and demultiplexing was carried out by in-house pipeline 
developed by assembling publicly available tools such as Casava v1.8.2 (Illumina) 
for the generation of fastqs, Trim Galore v0.2.8 (http://www.bioinformatics. 
babraham.ac.uk/projects/trim_galore/) and cutadapt v1.9dev2 for adaptor and 
quality trimming, and PRINSEQ v0.20.5"° for sample dereplication and low com- 
plexity filtering. In addition, Bowtie2 v2.2.3" was used to map reads to a database 
containing complete genomes and assemblies for bacteria, viruses, human, and 
vectors in the NCBI whole-genome sequencing (WGS) archive (as of March 2015). 
Reads in which the highest identity matches were not bacterial were removed from 
subsequent analysis. The edit distance (Levenshtein distance) was used to deter- 
mine the score of the alignments to the reference genomes”. For bacterial reads, 
the highest scoring match (greater than 90%) was chosen per read considering 
only the top 25 highest scoring alignments. In the event of multiple identical top 
scoring hits, the lowest common ancestor was determined. 

Reads in which the genomic coordinates overlap with known KEGG ortho- 
logues*?4 were tabulated, and KEGG modules were calculated step-wise and 
determined to be complete if 65% of the reaction steps were present per detected 
species and for the metagenome. Pathways were constructed for each taxa and 
metagenome by calculating the minimum set through MinPath* resulting from 
the gene orthologues present. A total of 19,967,936,136 reads (median 1,606,240 
reads per sample) were obtained from metagenomic sequencing and for subse- 
quent analysis each sample was rarefied to 100,000 reads. 

Statistical analysis. The analysis was conducted in two parts: (1) characterize 
the longitudinal maturation of the microbiome and (2) determine the significant 
covariates that influence microbiome development. For both parts of analysis, 
alpha diversity (richness and Shannon diversity) was calculated at the OTU-level 
for 16S rRNA gene sequencing and species-level for metagenomics data. Alpha 
diversity and taxonomic abundance were modelled using LOESS regression, 
and implemented and plotted with 95% confidence intervals in R (http://www. 
R-project.org) using the ggplot package”. 

DMM clustering. The first part of the analysis determined the key phases of micro- 
biome progression, which included the use of DMM. DMM bins samples on the 
basis of microbial community structure*’. The appropriate number of clusters was 
determined based on the lowest Laplace approximation score. For this specific 
analysis, samples up to month 46 of life were included, whereas all other analyses 
included samples up to month 40 of life. Including the additional samples here 
allowed for more accurate determination of the microbiome phases. 

The second part of the analysis sought to determine the significant covariates in 
shaping the microbiome profiles at discrete time points and further ascertain the 
significantly altered taxa based on samples up to month 40 of life. The framework 
for the statistical analysis considered the longitudinal nature of the dataset and 
accounted for the dynamic nature of the covariates. Owing to the potential that 
some covariates might influence the microbiome before the start date (for example, 
underlying indication for an antibiotic prescription) and some covariates will alter 
the microbiome for an unknown time frame (for example, microbiome disrupted 
by antibiotics may continue to be altered months after treatment), covariates were 
classified as ‘before’ ‘during; or ‘after’. In the case a covariate was negative for an 
infant, all samples would be classified as ‘never’. In instances in which several onsets 
of a covariate were possible (for example, multiple antibiotic start and end time 
points), after the first onset the covariate was classified as ‘after’ for the remaining 
samples, unless another event occurred, in which case ‘during’ would be applied 
where appropriate according to the start and stop dates. Analysis was performed 
at specific time windows, including samples collected between months 3-6, 7-10, 
11-14, 15-18, 19-22, 23-26, 27-30 and 31-40. Only the first sample collected from 
a given child was included in each time window to account for repeated measures. 


© 2018 Springer Nature Limited. All rights reserved. 
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EnvFit analysis to determine significant covariates. The effect size and significance 
of each covariate were determined using the ‘envfit’ function in ‘vegar’ (https:// 
cran.r-project.org/web/packages/vegan/index.html) comparing the difference in 
the centroids of each group relative to the total variation. Ordination was per- 
formed using NMDS based on Bray—Curtis dissimilarity. The significance value 
was determined based on 10,000 permutations. All P values derived from envfit 
were adjusted for multiple comparisons using FDR adjustment (Benjamini- 
Hochberg procedure)“. In total, 22 covariates with known associations to gut 
microbiome development in neonates, infants, and children were included in 
the envfit analysis and the grouping used for within each variable is presented in 
Extended Data Table 1. Specifically, we tested maternal factors including diabe- 
tes (gestational, T1D, T2D or none)*’, diabetes medication (insulin, metformin, 
glyburide, antihypertensives)*°, BMI°?””, gestational weight gain category (excess 
or non-excess)**, preeclampsia®’, maternal probiotic consumption™, as well as 
offspring factors such as prematurity'*°, birth mode!*!”°°, gender™, receipt of 
breast milk and/or formula!”*3°”-°”, introduction of solid foods™*!, geographical 
location?’, probiotics”, vitamin D supplementation®, antibiotics!*, household 
siblings**™, household furry pets**®, living on a farm with animals”, day-care 
exposure®, coeliac disease, acute disease, and chronic disease”. 

MaAsLin analysis to determine significant taxa associated with each covariate. 
MaAsLin was used for adjustment of covariates when determining the signifi- 
cance of taxa (genus level for 16S rRNA gene sequencing and species level for 
metagenomic sequencing) contributing to a specific variable, while accounting 
for potentially confounding covariates”. In brief, this multivariate linear mod- 
elling system for microbial data selects from among a set of (potentially high- 
dimensional) covariates to associate with microbial taxon or pathway abundances. 
Mixed-effects linear models using a variance-stabilizing arcsin square root trans- 
form on relative abundances are then used to determine the significance of putative 
associations from among this reduced set. Nominal P values across all associations 
are then adjusted using the Benjamini-Hochberg FDR method. Here, microbial 
features with corrected q < 0.25 were reported. All 22 covariates tested in the envfit 
were included in the adjustment regardless of significance by envfit. Subject age 
was also included to adjust for potential age driven changes in taxa within each 
three-month time window and IA and T1D outcome were included to adjust for 
the nested case control nature of the cohort. The default MaAsLin parameters 
were applied (maximum percentage of samples NA in metadata 10%, minimum 
percentage relative abundance 0.01%, P< 0.05, q< 0.25). All P values were adjusted 
for multiple comparisons using FDR“. 

Microbiota maturation modelling and linear mixed-effects analysis. The random 
forest regression model’! was performed as previously described”, using the ‘ran- 
domForest’ R package”. In brief, the model was trained on 150 randomly selected 
full term (>37 weeks gestation), vaginally delivered, breastfed infants who had a 
minimum of 10 samples included in the final dataset. The model was built using 
the default parameters: growing 10,000 trees and n/3 OTUs randomly sampled 
at each split, in which n represents the number of OTUs. The model was further 
refined by applying ‘rfcv’ with tenfold cross-validation resulting in the inclusion of 
20 OTUs to train the final model based on percentage increase in mean-squared 
error. These 20 OTUs explained 72% of the total variance of the model (compared 
to 75% with all OTUs included). The age of the subject predicted by this model was 
termed microbiota age and was further used to determine MAZ scores using the 
formulae described preiously”*. Significant differences in alpha diversity, microbi- 
ota age, and MAZ scores were calculated using linear mixed-effects models in R, 
with the ‘Imer’ command within the ‘Ime4’ package’. We included random slopes 
and intercept for individual children, and evaluated delivery mode, age, Bacteroides 
positive or negative, predominant diet, geographical location, presence of 
siblings, and presence of household pets as fixed effects. To perform these piecewise 
longitudinal models, we divided samples into the three developmental phases 
(<14 months, >15-<30 months, and >31 months). Owing to the relatively low 
number of samples in the exclusive and never breastfed groups, the analysis of 
breast milk status was conducted based on ‘some breast milk’ or ‘after breast milk’ 
with these groups found to cluster with exclusive and never breastfed, respectively. 
Determination of the datasets for IA and T1D nested case-control stability analyses. 
The development of persistent confirmed IA was assessed every three months. 
Persistent autoimmunity was defined by the presence of confirmed islet autoanti- 
body on two or more consecutive visits. The date of persistent autoimmunity was 
defined as the draw date of the first sample of the two consecutive samples that 
deemed the child persistent confirmed positive for a specific autoantibody (or 
any autoantibody). T1D was defined according to American Diabetes Association 
criteria for diagnosis”. A dataset with equal numbers of cases and control samples 
was created to preform conditional logistic regression of summary metric variables 
(that is, counting for each person the number of unique clusters exhibited and 
the number of temporal transitions between different clusters). On average, cases 
tended to have more samples than controls, and therefore had more transitions 
and observed states, which resulted in spurious associations between our metrics 


and disease outcome. For this purpose, we created a dataset in which case and 
control samples were matched to the paired case based on the nearest sample by 
day of life (unmatched sample or sample outside of +20% were omitted from anal- 
yses). This resulted in an analytical cohort of 316 IA cases and 316 paired controls 
(n=3,097 stool samples in each group) and 98 T1D cases and 98 paired controls 
(n= 1,270 stool samples in each group). For consistency, we used these datasets 
for all matched case-control analyses. The IA and T1D analysis was based on 16S 
rRNA gene sequencing data only and analysis of the metagenomic sequencing data 
(that is, species level taxonomic profiling and functional capacity) are presented 
in the companion paper". 

Taxonomic and metabolic profiling relative to IA onset in the matched case-control 
dataset. 16S rRNA gene sequencing data was used to determine differences 
between alpha diversity (number of OTUs (richness) and Shannon's diversity 
index), microbiota age, and MAZ scores. Significant differences in alpha diver- 
sity, microbiota age, and MAZ scores were calculated using linear mixed-effects 
models in R, with the ‘mer’ command within the ‘Ime4’ package’. To perform 
these piecewise longitudinal models, we divided samples into the three develop- 
mental phases (<14 months, >15-<30 months, and >31 months). Conditional 
logistic regression of matched case-control pairs was performed on the top 50 most 
dominant bacterial genera from samples prior to disease diagnosis. Odds ratios 
were calculated with 95% confidence intervals, adjusted for potential confounding 
variables, including age at sample collection, HLA genotype, mode of delivery, and 
duration of breastfeeding. Abundance information for genera was entered into the 
model as log)-transformed read counts. A value of 0.01 was added to avoid 0 s. The 
Benjamini—Hochberg procedure was applied to correct for multiple comparisons“ 
and corrected P < 0.05 was considered significant. 

Assessment of microbiome instability based on DMM clusters between IA or T1D 
cases and controls. For each subject, the total number of clusters exhibited through- 
out sampling per infant and the number of transitions between different clusters 
from one sample to the next were calculated to provide summary measures of 
microbiome stability over time. These summary metrics were then used in con- 
ditional logistic regression to assess the relationship of microbiome stability with 
IA and T1D. Odds ratios were calculated with 95% confidence intervals, adjusted 
for potential confounding variables, including HLA genotype, mode of delivery, 
duration of breastfeeding, number of antibiotic courses, and number of infectious 
episodes. 

Reporting summary. Further information on research design is available in the 
Nature Research Reporting Summary linked to this paper. 

Code availability. Code for the transition model showing the progression of 
samples through each DMM cluster, which are presented in Fig. 1 and Extended 
Data Fig. 2, has been made publicly available at https://github.com/StewartLab/ 
Stewart_TEDDY_Microbiome_Analysis. Other analysis software including quality 
control, taxonomic, and functional profilers is publicly available and referenced 
as appropriate. 


Data availability 

TEDDY microbiome 16S rRNA gene sequencing and metagenomic sequencing data 
that support the findings of this study have been deposited in the NCBI database of 
Genotypes and Phenotypes (dbGaP) with the primary accession code phs001443. 
v1.p1, in accordance with the dbGaP controlled-access authorization process. 
Clinical metadata analysed during the current study will be made available in the 
NIDDK Central Repository at https://www.niddkrepository.org/studies/teddy. 
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Extended Data Fig. 1 | Characterization of the gut microbiome over 

the first 40 months of life (n = 11,717). a-d, 16S rRNA gene sequencing 
(a-c) and metagenomic sequencing (d) analysis. Curves show LOESS fit 
for the data per category, and shaded areas show permutation-based 95% 
confidence intervals for the fit. a, Summary of overall dietary status. b, The 
mean alpha diversity (richness and Shannon diversity) per child increased 


rapidly from 3 to 20 months of life. c, The mean relative abundance of the 
five most abundant bacterial phyla show changes from 3 to 20 months of 
life and generally remain stable after month 30 of life. d, The mean relative 
abundance of the ten most abundant bacterial pathways shows relative 
stability, with ABC transporters and two-component system showing the 
largest reduction from 3 to 20 months of life. 
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Extended Data Fig. 2 | DMM clustering of metagenomic sequencing 
data (n = 10,867). The entire dataset formed 18 distinct clusters based 
on lowest Laplace approximation. a, Heat map showing the relative 
abundance of the 25 most dominant bacterial species per each DMM 
cluster. b, Box plots showing the alpha diversity (richness and Shannon's 
diversity) for each DMM cluster. The centre line shows the median, the 
boxes cover the 25th and 75th percentiles, and the whiskers extend to 
the most extreme data point, which is no more than 1.5 times the length 
of the box away from the box. Points outside the whiskers represent 
outlier samples. c, Transition model showing the progression of samples 
through each DMM cluster per each time point, from months 3 to 46 


of life. Dashed boxes show the three phases of microbiome progression 
(developmental, transitional and stable phase). Solid squares next to the 
labels denote the significant changes in phyla and Shannon diversity 

(H’) per phase based on multiple linear regression. All phyla and the H' 
were significant in the developmental phase, two phyla and the H' were 
significant in the transitional phase, and no phyla or the H' were in the 
stable phase. Nodes and edges are sized based on the total counts. Nodes 
are coloured according to DMM cluster number and edges are coloured 
by the transition frequency. Transitions with less than 2% frequency were 
omitted from the plot. 
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Extended Data Fig. 3 | Twenty bacterial OTUs classified by random 
forest regression analysis as most age discriminatory over the first 
40 months of life. Rank importance of OTUs determined by applying 
the random forest regression to the chronological age of 150 full-term, 
vaginally delivered, breastfed infants (n = 2,871 stool samples). The 
importance of OTUs is determined by the percentage increase in mean- 
squared error of microbiota age prediction when the relative abundance 
of each OTU were randomly permuted (mean importance + s.d., 
n= 100 replicates). These selected OTUs explained 72% of the variance 
(compared to 75% variance explained with all OTUs in model) and were 
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used to define maturation of the gut microbiome by microbiota age and 
MAZ score. OTUs are named to the genus level and coloured based on 
association with life stage; blue were associated with samples collected 
in the first 15 months, green with samples collected between months 15 
and 30, and red were with samples collected after month 30. a, Twenty 
OTUs ranked by importance to the accuracy of the model. The tenfold 
cross-validation error is also displayed in order of variable importance. 
Blue dotted line represents the 20 OTUs used in the model. b, Heat map 
of mean relative abundance of the 20 selected OTUs per month from 3 to 
40 months of age. 
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Extended Data Fig. 4 | The microbiota was not associated with the 
development of persistent IA and T1D. Data are based on 16S rRNA 
gene sequencing (n = 11,717). Analysis based on a nested 1:1 case-control 
cohort of equal samples. Curves show LOESS fit for the data per category, 
and shaded areas show permutation-based 95% confidence intervals for 
the fit. a, b, The number of OTUs (a) and the Shannon's diversity index (b) 
in the IA cohort. c, d, The number of OTUs (c) and Shannon’s diversity (d) 
in the T1D cohort. e, f, Microbiota age (e) and MAZ score (f) in the IA 
cohort. g, h, Microbiota age (g) and MAZ score (h) in the T1D cohort. 

i, j, Forest plot showing the odds ratios for the association between the 


2.55 
2 g 2.05 
cS) =o 
& 404 c 
2 2154 
=} 6 
Zz Pa 
204 === Control mmm 71D ® 404 === Control mam 71D 
fo 20 30 40 10 20 30 40 
Age in months Age in months 
g h 2:5 
@ 257 
= 
6 0.04 
£ 204 
o N 
aD q —2.54 
© 454 
8 15 S 
je} 
2 -5.04 
8 104 
= m= Control mam 71D -~7.54 ™™= Control mm T1D 
5 : : : : T : : — 
10 20 30 40 10 20 30 40 
Subject age (months) Subject age (months) 
. 
o 
® O-Inf 4 
fo) 
ia) 
pt 12-Inf} 4 
2 
2 
@ 6-12 4 
no 
= 
5 0-64 | 
= 
: : + : : + 7 
0.5 1.0 1.5 2.0 0.4 0.8 1.2 1.6 


Odds ratio (+/- 95% confidence interval) 


microbiome stability metrics and development of IA (i) and T1D (j). 

A separate conditional logistic regression was run for four time intervals: 
(1) birth to onset; (2) 12 months before onset; (3) 6-12 months before 
onset; and (4) 6 months before onset. Models were adjusted for HLA 
genotype, mode of delivery, duration of exclusive breastfeeding, number of 
antibiotic courses, and number of infectious episodes. Community states 
are the total number of unique clusters exhibited by an infant and state 
transitions are the number of transitions between clusters. No odds ratio 
was significantly different between cases and controls (Supplementary 
Table 3). 
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Extended Data Fig. 6 | The relative abundance of Bacteroides stratified 
by breast milk and geographical location. Curves show LOESS fit for 

the data per category, and shaded areas show permutation-based 95% 
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mixed-effects models in accordance with observed phases of maturation: 
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Extended Data Table 1 | Overview of the entire analytical cohort 


Month Total cohort 3 to6 7 to 10 ll to 14 15 to 18 
Number of Subjects 903 810 800 647 543 
Number of Samples 12,005 810 800 647 543 

No. samples per subject (IQR) 11 (6-19) 1 1 1 1 


Median age of samples in months (IQR) 13.8 (8-22.2) 4.1 (3.8-4.5) 7.7 (7.2-8.1) 11.5 (11.2-12) 15.6 (15.2-16.1) 
Maternal BMI category* 


19 to 22 
440 

440 

1 

19.7 (19.2-20.2) 


23 to 26 

336 

336 

1 

23.8 (23.3-24.2) 


27 to 30 

273 

273 

1 

27.7 (27.2-28.2) 


LETTER 


31 to 40 

220 

220 

1 

31.6 (31.2-32.3) 


Underweight 326 (3%) 27 (3%) 25 (3%) 17 (3%) 15 (3%) 11 (3%) 9 (3%) 6 (2%) 5 (2%) 
Normal 7209 (60%) 478 (59%) 479 (60%) 379 (59%) 328 (60%) 265 (60%) 204 (61%) 165 (60%) 140 (64%) 
Overweight 2991 (25%) 197 (24%) 189 (24%) 162 (25%) 132 (24%) 114 (26%) 92 (27%) 75 (27%) 55 (25%) 
Obese 1386 (12%) 100 (12%) 101 (13%) 83 (13%) 64 (12%) 47 (11%) 28 (8%) 25 (9%) 18 (8%) 
NA 93 (1%) 8 (1%) 6 (1%) 6 (1%) 4 (1%) 3 (1%) 3 (1%) 2(1%) (0%) 
Maternal preg. weight gain* 
Non-excess 8497 (71%) 584 (72%) 579 (72%) 473 (73%) 386 (71%) 308 (70%) 236 (70%) 193 (71%) 151 (69%) 
Excess 3415 (28%) 218 (27%) 215 (27%) 168 (26%) 153 (28%) 129 (29%) 97 (29%) 78 (29%) 67 (30%) 
NA 93 (1%) 8 (1%) 6 (1%) 6 (1%) 4(1%) 31%) 31%) 2 (1%) 2(1%) 
Maternal diabetes medication* 
Insulin 977 (8%) 66 (8%) 64 (8%) 56 (9%) 41 (8%) 28 (6%) 23 (7%) 23 (8%) 19 (9%) 
Metformin and insulin 38 (0%) 3 (0%) 2 (0%) 1 (0%) 1 (0%) 1 (0%) 1 (0%) 1 (0%) 1(0%) 
Glyburide 126 (1%) S5(%) 6 (1%) 6 (1%) 51%) 3 (1%) 4 (1%) 4(1%) 3 (1%) 
None 10864 (90%) 736 (91%) 728 (90%) 584 (90%) 496 (91%) 400 (91%) 304 (90%) 245 (90%) 197 (90%) 
Maternal diabetes* 
Gestational 779 (6%) 56 (7%) 53 (7%) 43 (7%) 36 (7%) 29 (7%) 21 (6%) 16 (6%) 16 (7%) 
None 10060 (84%) 673 (83%) 672 (84%) 535 (83%) 458 (84%) 368 (84%) 280 (83%) 229 (84%) 185 (84%) 
TID 839 (7%) 53 (7%) 54 (7%) 49 (8%) 35 (6%) 29 (7%) 25 (7%) 22 (8%) 16 (7%) 
T2D 30 (0%) 30%) 3 (0%) 20%) 20%) 1 (0%) 0 (0%) 0. (0%) 00%) 
NA 297 (2%) 25 (3%) 18 (2%) 18 (3%) 12 (2%) 13 (3%) 10 (3%) 6 (2%) 3 (1%) 
Maternal preeclampsia* 
Yes 445 (4%) 35 (4%) 32 (4%) 23 (4%) 21 (4%) 17 (4%) 11 (3%) 9 (3%) 9 (4%) 
No 11439 (95%) 765 (94%) 757 (95%) 618 (96%) 516 (95%) 418 (95%) 322 (96%) 262 (96%) 210 (95%) 
NA 121 (1%) 10 (1%) 11 (1%) 6 (1%) 6 (1%) S(1%) 3 (1%) 2 (1%) 1 (0%) 
Maternal probiotic* 
Yes 584 (5%) 43 (5%) 42 (5%) 30 (5%) 24 (4%) 21 (5%) 17 (5%) 14 (5%) 9 (4%) 
No 11421 (95%) 767 (95%) 758 (95%) 617 (95%) 519 (96%) 419 (95%) 319 (95%) 259 (95%) 211 (96%) 
Birth mode 
Caesarian 3319 (28%) 196 (24%) 200 (25%) 170 (26%) 146 (27%) 121 (28%) 88 (26%) 84 (31%) 67 (30%) 
Vaginal 8686 (72%) 614 (76%) 600 (75%) 477 (74%) 397 (73%) 319 (73%) 248 (74%) 189 (69%) 153 (70%) 
Preterm (<37 weeks) 
Yes 723 (6%) 39 (5%) 43 (5%) 35 (5%) 30 (6%) 29 (7%) 20 (6%) 16 (6%) 16 (7%) 
No 11282 (94%) 771 (95%) 757 (95%) 612 (95%) 513 (94%) 411 (93%) 316 (94%) 257 (94%) 204 (93%) 
Geographical location 
Colorado 2079 (17%) 122 (15%) 120 (15%) 99 (15%) 84 (15%) 57 (13%) 45 (13%) 39 (14%) 30 (14%) 
Finland 2833 (24%) 223 (28%) 215 (27%) 163 (25%) 131 (24%) 107 (24%) 85 (25%) 57 (21%) 40 (18%) 
Georgia 872 (7%) 46 (6%) 53 (7%) 50 (8%) 48 (9%) 33 (8%) 23 (7%) 22 (8%) 18 (8%) 
Germany 1219 (10%) 86 (11%) 78 (10%) 60 (9%) 46 (8%) 46 (10%) 34 (10%) 28 (10%) 16 (7%) 
Sweden 4024 (34%) 260 (32%) 262 (33%) 216 (33%) 190 (35%) 150 (34%) 119 (35%) 102 (37%) 99 (45%) 
Washington 978 (8%) 73 (9%) 72 (9%) 59 (9%) 44 (8%) 47 (11%) 30 (9%) 25 (9%) 17 (8%) 
Sex 
Male 6455 (54%) 497 (61%) 438 (55%) 357 (55%) 297 (55%) 230 (52%) 175 (52%) 146 (53%) 119 (54%) 
Female 5550 (46%) 372 (46%) 362 (45%) 290 (45%) 246 (45%) 210 (48%) 161 (48%) 127 (47%) 101 (46%) 
Race/Ethnicity** 
African Americans 24 (0%) 2 (0%) 2 (0%) 1 (0%) 1 (0%) 1 (0%) 1 (0%) 2(1%) 0 (0%) 
Hispanic 445 (4%) 32 (4%) 32 (4%) 29 (4%) 22 (4%) 12 (3%) 10 (3%) 4(1%) 3 (1%) 
White Non-Hispanic 7391 (62%) 512 (63%) 497 (62%) 395 (61%) 322 (59%) 272 (62%) 201 (60%) 161 (59%) 199 (90%) 
All Other Races 128 (1%) 6 (1%) 8 (1%) 6 (1%) 8 (1%) 4 (1%) S(1%) 5 (2%) 21%) 
NA 4017 (33%) 258 (32%) 261 (33%) 216 (33%) 190 (35%) 151 (34%) 119 (35%) 101 (37%) 16 (37%) 
Breast milk status 
No breast milk ever 220 (2%) 13 (2%) 15 (2%) 12 (2%) 11 (2%) 7(2%) 7(2%) 8 (3%) 3 (1%) 
Exclusive 240 (2%) 136 (17%) 4 (1%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 
Some breast milk 3048 (25%) 445 (55%) 417 (52%) 170 (26%) 52 (10%) 27 (6%) 10 (3%) 7 (3%) 5 (2%) 
After breast milk 8497 (71%) 216 (27%) 364 (46%) 465 (72%) 480 (88%) 406 (92%) 319 (95%) 258 (95%) 212 (96%) 
Solid food status 
Before 602 (5%) 373 (46%) 4(1%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 
After 11403 (95%) 437 (54%) 796 (100%) 647 (100%) 543 (100%) 440 (100%) 336 (100%) 273 (100%) 220 (100%) 
Probiotic 
Never 8017 (67%) 516 (64%) 508 (64%) 432 (67%) 363 (67%) 288 (65%) 222 (66%) 184 (67%) 149 (68%) 
Before 1183 (10%) 155 (19%) 117 (15%) 63 (10%) 45 (8%) 27 (6%) 18 (5%) 9 (3%) 10 (5%) 
During 2296 (19%) 122 (15%) 153 (19%) 136 (21%) 115 (21%) 107 (24%) 78 (23%) 57 (21%) 42 (19%) 
After 509 (4%) 17 (2%) 22 (3%) 16 (2%) 20 (4%) 18 (4%) 18 (5%) 23 (8%) 19 (9%) 
Vitamin D 
Never 692 (6%) 48 (6%) 46 (6%) 39 (6%) 30 (6%) 21 (5%) 17 (5%) 13 (5%) 9 (4%) 
During 11197 (93%) 756 (93%) 747 (93%) 602 (93%) 508 (94%) 413 (94%) 313 (93%) 257 (94%) 208 (95%) 
Afier 116 (1%) 6 (1%) 7 (1%) 6 (1%) 5 (1%) 6 (1%) 6 (2%) 3 (1%) 3 (1%) 
Antibiotics 
Never 3796 (32%) 241 (30%) 240 (30%) 198 (31%) 166 (31%) 138 (31%) 106 (32%) 83 (30%) 78 (35%) 
Before 2358 (20%) 415 (51%) 287 (36%) 152 (23%) 63 (12%) 11 (3%) 0 (0%) 0 (0%) 0 (0%) 
During 288 (2%) 19 (2%) 26 (3%) 24 (4%) 18 (3%) 12 (3%) 0 (0%) 0 (0%) 0 (0%) 
After 5563 (46%) 135 (17%) 247 (31%) 273 (42%) 296 (55%) 279 (63%) 230 (68%) 190 (70%) 142 (65%) 
Household siblings 
Never 2187 (18%) 168 (21%) 164 (21%) 131 (20%) 108 (20%) 79 (18%) 53 (16%) 33 (12%) 25 (11%) 
Before 2890 (24%) 167 (21%) 171 (21%) 147 (23%) 131 (24%) 115 (26%) 87 (26%) 76 (28%) 58 (26%) 
During 6877 (57%) 468 (58%) 460 (58%) 367 (57%) 303 (56%) 245 (56%) 194 (58%) 163 (60%) 136 (62%) 
After 7 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 2(1%) 1 (0%) 1 (0%) 
NA 44 (0%) 71%) 5%) 2 (0%) 1 (0%) 1 (0%) 0 (0%) 0 (0%) 0 (0%) 
Household flurry pets 
Never 6001 (50%) 418 (52%) 414 (52%) 329 (51%) 275 (51%) 227 (52%) 166 (49%) 133 (49%) 107 (49%) 
Before 779 (6%) 50 (6%) 45 (6%) 39 (6%) 36 (7%) 30 (7%) 25 (7%) 20 (7%) 15 (7%) 
During 5160 (43%) 336 (41%) 337 (42%) 278 (43%) 232 (43%) 183 (42%) 141 (42%) 116 (42%) 96 (44%) 
After 40 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0(0%) 4(1%) 4 (1%) 2(1%) 
NA 25 (0%) 6 (1%) 4 (1%) 1 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 
Lives on farm with animals 
Never 10385 (87%) 692 (85%) 688 (86%) 559 (86%) 474 (87%) 379 (86%) 289 (86%) 238 (87%) 193 (88%) 
Before 478 (4%) 35 (4%) 31 (4%) 26 (4%) 25 (5%) 19 (4%) 14 (4%) 10 (4%) 6 (3%) 
During 1085 (9%) 71 (10%) 77 (10%) 60 (9%) 43 (8%) 42 (10%) 31 (9%) 22 (8%) 19 (9%) 
After 32 (0%) 0 (0%) 0 (0%) 1 (0%) 1 (0%) 0 (0%) 2(1%) 3(1%) 2(1%) 
NA 25 (0%) 6 (1%) 4 (1%) 1 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 
Daycare exposure 
Yes 4600 (38%) 49 (6%) 115 (14%) 139 (21%) 237 (44%) 263 (60%) 227 (68%) 194 (71%) 165 (75%) 
No 7405 (62%) 761 (94%) 685 (86%) 508 (79%) 306 (56%) 177 (40%) 109 (32%) 79 (29%) 55 (25%) 
Celiac disease 
Yes 506 (4%) 27 (3%) 28 (4%) 25 (4%) 22 (4%) 19 (4%) 12 (4%) 13 (5%) 10 (5%) 
No 11499 (96%) 783 (97%) 772 (97%) 622 (96%) 521 (96%) 421 (96%) 324 (96%) 260 (95%) 210 (95%) 
Any chronic disease/disorder 
Never 8937 (74%) 622 (77%) 604 (76%) 486 (75%) 411 (76%) 333 (76%) 247 (74%) 195 (71%) 161 (73%) 
Before 1815 (15%) 154 (19%) 136 (17%) 102 (16%) 72 (13%) 54 (12%) 37 (11%) 32 (12%) 20 (9%) 
During 1212 (10%) 33 (4%) 58 (7%) 57 (9%) 59 (11%) 52 (12%) 51 (15%) 43 (16%) 37 (17%) 
After 41 (0%) 1 (0%) 2 (0%) 2 (0%) 1 (0%) 1 (0%) 1 (0%) 3 (1%) 2(1%) 
Any acute disease/disorder 
Never 17 (0%) 4 (0%) 3 (0%) 0 (0%) 0(0%) 0 (0%) 0 (0%) 0 (0%) 0(0%) 
Before 1469 (12%) 599 (74%) 121 (15%) 18 (3%) 3(1%) 1 (0%) 0 (0%) 0 (0%) 0 (0%) 
During 5107 (43%) 165 (20%) 321 (40%) 308 (48%) 265 (49%) 191 (43%) 151 (45%) 135 (49%) 96 (44%) 
After 5412 (45% 42 (5% 355 (44%) 321 (50%) 275 (51% 248 (56%) 185 (55%) 138 (51% 124 (56%) 


IQR, interquartile range; NA, not available. 
*Maternal variables relate to measurements obtained during pregnancy. 
*kRace/ethnicity was only available for the US sites. 
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The human gut microbiome in early-onset type 1 
diabetes from the TEDDY study 


Tommi Vatanen!*, Eric A. Franzosa!?, Randall Schwager”, Surya Tripathi’, Timothy D. Arthur!, Kendra Vehik*, Ake Lernmark‘, 
William A. Hagopian®, Marian J. Rewers®, Jin-Xiong She’, Jorma Toppari®’, Anette-G. Ziegler!°!-", Beena Akolkar®, 
Jeffrey P. Krischer?, Christopher J. Stewart!*>, Nadim J. Ajami", Joseph F. Petrosino™, Dirk Gevers!!°, Harri Lahdesmaki"®, 


Hera Vlamakis!, Curtis Huttenhower!2:2°* & Ramnik J. Xavier 


Type 1 diabetes (T1D) is an autoimmune disease that targets 
pancreatic islet beta cells and incorporates genetic and 
environmental factors!, including complex genetic elements’, 
patient exposures* and the gut microbiome’. Viral infections® 
and broader gut dysbioses® have been identified as potential 
causes or contributing factors; however, human studies have not 
yet identified microbial compositional or functional triggers that 
are predictive of islet autoimmunity or T1D. Here we analyse 
10,913 metagenomes in stool samples from 783 mostly white, non- 
Hispanic children. The samples were collected monthly from three 
months of age until the clinical end point (islet autoimmunity or 
T1D) in the The Environmental Determinants of Diabetes in the 
Young (TEDDY) study, to characterize the natural history of the 
early gut microbiome in connection to islet autoimmunity, T1D 
diagnosis, and other common early life events such as antibiotic 
treatments and probiotics. The microbiomes of control children 
contained more genes that were related to fermentation and 
the biosynthesis of short-chain fatty acids, but these were not 
consistently associated with particular taxa across geographically 
diverse clinical centres, suggesting that microbial factors associated 
with T1D are taxonomically diffuse but functionally more coherent. 
When we investigated the broader establishment and development 
of the infant microbiome, both taxonomic and functional profiles 
were dynamic and highly individualized, and dominated in the 
first year of life by one of three largely exclusive Bifidobacterium 
species (B. bifidum, B. breve or B. longum) or by the phylum 
Proteobacteria. In particular, the strain-specific carriage of genes 
for the utilization of human milk oligosaccharide within a subset 
of B. longum was present specifically in breast-fed infants. These 
analyses of TEDDY gut metagenomes provide, to our knowledge, 
the largest and most detailed longitudinal functional profile of the 
developing gut microbiome in relation to islet autoimmunity, T1D 
and other early childhood events. Together with existing evidence 
from human cohorts”* and a T1D mouse model’, these data support 
the protective effects of short-chain fatty acids in early-onset human 
TID. 

Recent literature has linked several facets of gut health with the 
onset of T1D in humans and rodent models**". Altered intestinal 
microbiota in connection to T1D has been reported in Finnish?®!)12, 
German", Italian'*, Mexican!’, American (Colorado)"® and Turkish!” 
children. Common findings include increased numbers of Bacteroides 


1,17,18, 20 


species, and deficiency of bacteria that produce short-chain fatty 
acids (SCFAs)”* in cases of T1D or islet autoimmunity (IA)®1!1>!8, 
Corroborating these findings, decreased levels of SCFA-producing 
bacteria were found in adults with type 2 diabetes (T2D)”. In addi- 
tion, increased intestinal permeability!* and decreased microbial diver- 
sity’? after IA but before T1D diagnosis have been reported. Studies 
using the nonobese diabetic (NOD) mouse model have determined 
immune mechanisms that mediate the protective effects of SCFAs” 
and the microbiome-linked sex bias in autoimmunity”. NOD mice 
fed specialized diets resulting in high bacterial release of the SCFAs 
acetate and butyrate were almost completely protected from T1D®. A 
study in a streptozotocin-induced T1D mouse model demonstrated 
that bacterial products recognized in pancreatic lymph nodes contribute 
to pathogenesis”!. 

Even in the absence of immune perturbation, the first few weeks, 
months and years of life represent a unique human microbial environ- 
ment that has only recently been detailed”””>, Infants have a markedly 
different gut microbial profile from adults, characterized by a dis- 
tinct taxonomic profile, greater proportion of aerobic energy harvest 
metabolism, and more extreme dynamic change”. These differences 
gradually fade over the first few years of life, particularly in response to 
the introduction of solid food, and individual microbial developmental 
trajectories are influenced by environment, delivery mode, breast (versus 
formula) feeding, and antibiotics**-?”. Most studies that address the 
development of the gut microbiome, both generally and in associa- 
tion with T1D, have used gene analysis of 16S rRNA, which leaves 
open the question of functional and strain-specific differences that are 
not easily detected by this technology that might contribute to disease 
pathogenesis!”. 

Bridging this gap is one goal of the The Environmental Determinants 
of Diabetes in the Young (TEDDY) study, a prospective study that 
aims to identify environmental causes of T1D*. It includes six clinical 
research centres in the United States (Colorado, Georgia/Florida and 
Washington) and Europe (Finland, Germany and Sweden), which 
together have recruited several thousand newborns with a genetic 
predisposition for T1D or first-degree relative(s) with T1D. This has 
enabled the TEDDY study to collect a range of biospecimens, including 
monthly stool samples starting at three months of age, coupled with 
extensive clinical and personal data such as diet, illnesses, medications 
and other life experiences. To characterize microbial, environmental, 
genetic, immunological and additional contributors to the development 
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Fig. 1 | More than 10,000 longitudinal gut metagenomes from the 
TEDDY T1D cohort. We analysed 10,913 metagenomes collected 
longitudinally from 783 children (415 controls, 267 seroconverters, and 
101 diagnosed with T1D) approximately monthly over the first five years 
of life. a, Subjects were recruited at six clinical centres (Finland, Sweden, 
Germany, Washington, Georgia and Colorado). Primary end points were 
seroconversion (defined as persistent confirmed IA) and T1D diagnosis. 


of T1D, the TEDDY study group further assembled nested case-control 
studies for IA (n= 418 case-control pairs) and T1D (n= 114)”. Case- 
control pairs were matched by clinical centre, sex and family history of 
TID, which are all known confounding factors for T1D susceptibility 
and microbiome composition. 

Here, we assessed 783 children followed from three months to up 
to five years of age from six clinical centres in four countries (Finland, 
Germany, Sweden and the United States) who either progressed to 
persistent IA or T1D or were matched as controls (Fig. 1a, b, Extended 
Data Table 1). Stool samples were collected, on average, monthly start- 
ing at three months of age and continuing until the clinical end point 
(IA or T1D). This study focused solely on analysing metagenomic 
sequencing data (n= 10,903 samples, n = 783 subjects), while a com- 
panion paper by Stewart et al.*° interrogated corresponding 16S rRNA 
amplicon sequencing information. 

We first investigated the taxonomic composition of early gut metage- 
nomes at the species level. Principal coordinate analysis ordination 
of Bray—Curtis beta diversities showed a strong longitudinal gradient 
and marked heterogeneity among the earliest samples (Fig. 2a, 
Extended Data Fig. la-k, Supplementary Note 1). Permutational anal- 
ysis of variance (ANOVA) of Bray—Curtis beta diversities indicated 
that inter-subject differences explained 35% of microbial taxonomic 
variation (permutation test, P< 0.001, 1,000 permutations), followed 
by age at stool sampling at roughly 4% of variance (P< 0.001). Using 
cross-sectional analysis to test for associations between taxonomic 
beta diversities and other collected metadata, we found that in addi- 
tion to subject ID and age, geographical location and breastfeeding 
had strong and systematic effects on the composition of the micro- 
bial community (Supplementary Table 1, Extended Data Fig. 2a—d, 
Supplementary Note 1). To investigate the stability and individual- 
ity of the microbial profiles further, we compared intra- and inter- 
subject Bray—Curtis beta diversities. The gap between individual 
stability and similarity within or across clinical centres was largest at 
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Additional metadata analysed for subjects and samples included the status 
of breastfeeding, birth mode, probiotics, antibiotics, formula feeding, 

and other dietary covariates. b, Overview of stool samples collected and 
microbiome development as summarized by Shannon's alpha diversity and 
stratified by end point. Median number of samples per individual n= 12 
(healthy controls n= 10, seroconverters n = 13, T1D cases n= 16). 


the beginning of the sampling period, indicating that the children had 
particularly dissimilar microbiota during these early months (Fig. 2b, 
Supplementary Note 1). Finally, we tested microbial alpha diversity 
(Shannon's diversity index) of taxonomic profiles for associations with 
collected metadata, and found that the cessation of breastfeeding had 
the largest effect (ANOVA, partial 7” =0.053) in the accrual of alpha 
diversity in early life (Supplementary Table 2, Extended Data Fig. 3a-e, 
Supplementary Note 1). 

We next investigated the effects of antibiotics on the early life micro- 
biome. Courses of oral antibiotics disrupted microbial stability, with a 
larger effect in the earliest comparisons (Fig. 2c, Extended Data Fig. 4a-f, 
Extended Data Table 2, Supplementary Note 2). Previous studies have 
found Bifidobacterium species to be especially vulnerable to antibiotics*!*”, 
leading us to investigate how antibiotic perturbations influenced these 
common dominant members of the early gut. Comparing microbial 
relative abundances before and after antibiotics (assuming that the 
given species was present in the preceding sample), we saw a decrease 
in the abundances of the Bifidobacterium members B. bifidum, B. 
pseudocatenulatum, B. adolescentis, B. dentium and B. catenulatum, 
whereas B. longum and B. breve did not systematically decline owing 
to antibiotics (Fig. 2d), suggesting that certain Bifidobacterium species 
are particularly susceptible to out-competition by other community 
members after depletion by antibiotics. Given their dominance in the 
typical developing gut microbiota and finely tuned balance of metabolic 
interactions with breast milk, this finding underscores the importance 
of approaching antibiotic prescriptions in early childhood with care, 
especially during breastfeeding. 

Accompanying our taxonomic profiling, functional profiling of these 
metagenomes suggested the development of a consistent microbial 
functional core during infancy, with a smaller subject-specific variable 
functional pool (Extended Data Fig. 5a, b, Supplementary Note 3). 
As in most microbial community studies**, microbial gene families 
of uncharacterized function made up a substantial fraction of these 
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profiles, averaging roughly 50% based on Gene Ontology* annotations 
(Extended Data Fig. 5c) and more than 90% based on more function- 
ally specific MetaCyc pathways (Extended Data Fig. 5d). We observed 
an increasing longitudinal trend in the proportion of unmapped reads 
(Extended Data Fig. 5e, Pearson’s r=0.318, P< 2.2 x 10~'°). However, 
within the reads that mapped to either microbial pangenomes or known 
protein sequences (the proportion of which decreased with age), we 
saw an increase in the proportion of reads with MetaCyc annotation, 
mainly during the first year (Extended Data Fig. 5f, Pearson r= 0.391, 
P<2.2 x 107'%). This suggests that although the early life microbiome 
is relatively well-covered by current microbial reference genomes, less 
functional and biochemical characterization has been carried out on 
gene families within these microorganisms, which will thus particularly 
benefit from future work. 

In addition to broadly conserved and subject-specific functions, we 
identified a range of microbial metabolic enzymes that consistently 
increased or decreased in abundance over the first year of life, parallel- 
ing shifts in community structure and infant diet (Fig. 3, Supplementary 
Note 3, Supplementary Table 3). For example, the enzyme L-lactate 
dehydrogenase (1.1.1.27), which is well-characterized in Bifidobacteria 
for its role in milk fermentation*®, was among the most consistently 
declining enzymes over this period, notably coinciding with the cessa- 
tion of breastfeeding in many infants (from 73% breastfed at month 3 
to 28% at year 1). Conversely, the enzyme transketolase (2.2.1.1), which 
has been implicated previously** in the metabolism of fibre, was among 
the most consistently increasing enzymes, which also coincided with 
increased incorporation of solid food (a component of 53% of infants’ 
diets at month 3 versus 100% at year 1). Hence, these notable changes 
in community functional potential highlight the unique metabolic 
environment of the early infant gut, and the subsequent transition to 
a more adult-like gut microbiome that is adapted to variable, fermen- 
tative energy sources. 

Combining taxonomic and functional profiles to test for differences 
between cases and controls, we used linear mixed-effects modelling 
and identified a relatively small number of individual taxonomic and 
functional features that were associated with case-control outcome 
(Supplementary Table 4), most with borderline statistical significance 
(false discovery rate (FDR) corrected q-values indicated below). We 
confirmed separation between cases and controls by random forest 
classifiers (Extended Data Fig. 6a, b, Supplementary Note 4). In the 
IA case-control cohort, healthy controls contained higher levels of 
Lactobacillus rhamnosus (q = 0.055), supporting protection against 
IA by early probiotic supplementation’ (Extended Data Fig. 6c, d, 
Supplementary Note 5). IA controls also had more Bifidobacterium 
dentium (q=0.054), whereas IA cases had on average higher abun- 
dance of Streptococcus group mitis/oralis/pneumoniae species 
(q=0.11). In T1D case-control comparisons, controls had higher 
levels of Streptococcus thermophilus (q = 0.078) and Lactococcus lactis 
(q =0.094) species, both common in dairy products, whereas cases 
contained higher levels of species such as Bifidobacterium pseudoca- 
tenulatum (q=0.078), Roseburia hominis (q=0.11) and Alistipes shahii 
(q=0.14). Even though our modelling approach controlled for regional 
differences in clinical centres, we found additional but often weak asso- 
ciations with outcome in some clinical centres when tested separately 
(Supplementary Table 4). Finnish IA cases had more Streptococcus 
group mitis/oralis/pneumoniae species (q=0.0008), IA controls from 
Colorado had more Streptococcus thermophilus (q=0.0059), and 
Swedish IA cases contained more Bacteroides vulgatus (q=0.090). 

Pathways with the highest statistical significance in case-control 
comparisons were related to bacterial fermentation (Supplementary 
Table 4). The superpathway of fermentation (MetaCyc identifier 
PWY4LZ-257) was increased in controls in the T1D cohort (q=0.019) 
and Finnish IA cohort (q = 0.049). SCFAs such as butyrate, acetate and 
propionate are common by-products of bacterial fermentation, and 
butyrate and acetate protected NOD mice against T1D”. Consistently, 
we observed that several bacterial pathways that contribute to the bio- 
synthesis of short-chain fatty acids were increased in healthy controls. 
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Fig. 2 | The early gut microbiome is characterized by early 
heterogeneity of Bifidobacterium species and individualized accrual 

of taxa over time. a, Principal coordinate analysis (PCoA) ordination of 
microbial beta diversities (n = 10,913 samples), measured by Bray—Curtis 
dissimilarity. Arrows show the weighted averages of key taxonomic groups. 
b, Microbiota stability, measured by Bray-Curtis (BC) dissimilarity 

(n= 10,750 samples) in three-month time windows, over two-month 
increments, stratified into three groups: within subject, within clinical 
centre, and between clinical centres. Lines show median values per time 
window. Shaded area denotes the estimated 99% confidence interval. 

Gut microbial communities were highly individual. c, Influence of 
antibiotic (Abx) courses on microbial stability, measured by Bray—Curtis 
dissimilarity over consecutive stool samples (<50 days apart) from 

the same individual during the first three years of life, and stratified 

by whether antibiotics were given between the two samples (n = 654 
observations with antibiotics, n = 6,734 observations without antibiotics). 
Curves show locally weighted scatterplot smoothing (LOESS) for the 

data per category. Shaded areas show permutation-based 95% confidence 
intervals for the fit. d, Decreases in the most common Bifidobacterium 
species in connection to oral antibiotic treatments. Fold change was 
measured between consecutive samples with an antibiotic course between 
them, given that the species in question was present in the first of the 

two samples. Sample size per species (n) indicates the number of sample 
pairs in which the species in question was present in the sample before the 
antibiotic treatment. Bars show bootstrapped mean log,(fold change) (that 
is, decrease), and error bars denote s.d. (n = 1,000 bootstrap samples). 


Among pathways involved in butyrate production, the degradation 
of L-arginine, putrescine and 4-aminobutanoate (ARGDEG-PWY) 
superpathway was increased in T1D controls cohort-wide (q=0.043), 
whereas the fermentation of acetyl coenzyme A to butanoate (PWY- 
5676) was more abundant in the Finnish T1D controls (q=0.053). The 
degradation of acetylene (P161-PWY), which contributes to acetate 
production, was increased in T1D controls cohort-wide (q=0.14), and 
the degradation of L-1,2-propanediol (PWY-7013), which is involved 
in propionate biosynthesis, was higher in the German T1D controls 
(q=0.019). These findings support existing evidence for the protective 
effects of SCFAs in human T1D”* and T2D”’ cohorts and the NOD 
mouse model’. 

As reflected by the community-level analyses, human milk with its 
pro- and prebiotic functions is one of the main factors that deter- 
mine the community composition of the infant gut microbiome. 
Bifidobacterium longum subsp. infantis is a particularly versatile 
degrader of human milk oligosaccharide (HMO) that is often found 
in stool samples collected during breastfeeding**. By following the 
families representing genes in the B. longum subsp. infantis HMO gene 


cluster3?“° in our data, we found that an additional 30 bacterial species 
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Fig. 3 | Consistent changes in enzymatic content of the gut microbiome 
in early life. We identified enzyme families (level-4 Enzyme Commission 
(EC) categories) that exhibited the most consistent within-subject 
changes in total community abundance between the ages of 3 months and 
1 year. The top 20 most consistent increases or decreases are presented 
and stratified according to their top 15 contributing species. Heat map 
values reflect the mean contribution of each species to each enzyme over 


carried at least one homologue with more than 50% sequence iden- 
tity to one or more HMO utilization genes (Supplementary Table 5). 
As expected, many Bifidobacteria carried several homologues, but 
surprisingly three Enterococcus species (E. casseliflavus, E. faecalis and 
E. faecium) also carried seven or more homologues (Supplementary 
Table 5). 

To identify strain-level adaptation similar to B. longum subsp. 
infantis, we further examined whether any of these genes showed con- 
trasting prevalence between samples collected during breastfeeding 
and after weaning, given that the carrier species itself was present. In 
total, 41 gene families were observed more often during breastfeeding 
(Supplementary Table 5, test of proportions, adjusted P< 0.001); most 
(37 out of 41) were carried by B. longum (Fig. 4), and B. pseudoca- 
tenulatum contained four such gene families (Extended Data Fig. 7, 
Supplementary Table 5). In samples with B. longum, this implicated a 
clear strain shift after weaning, when fewer B. longum strains carried 
these genes (Fig. 4). In samples with B. pseudocatenulatum, four gene 
families showed a similar but less contrasting pattern (Extended Data 
Fig. 7). Overall, these observations identify new candidate species that 
contribute to HMO processing or exploitation, and link strain compo- 
sition to specific driving molecular functions that potentially explain 
selective sweeps during microbiome development, in this case specif- 
ically related to breastfeeding. 

Despite ample sample size, scrutiny of the study design, and thor- 
ough statistical analyses, most of the taxonomic and functional signals 
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samples (n = 733 at 3 months; 675 at 1 year; and 382 at 2 years). Values 
reflect units of copies per million (CPM) normalized to total read depth 
(including unmapped reads and reads mapped to gene families lacking 
EC annotation). Rows (enzymes) and columns (species) are clustered 
according to Spearman correlation at 3 months; subsequent years are 
ordered according to clustering at 3 months. 


we detected in case-control comparisons were modest in effect size 
and statistical significance. This could be due to several reasons— 
differences between T1D endotypes, temporally diffuse signals, 
geographical heterogeneity, or lack of stool samples for the first two 
months of life —and these should be considered in future investigations 
(Supplementary Note 6). Furthermore, the data used in these investi- 
gations was composed of samples from the genetically predisposed 
and mostly white, non-Hispanic case-control groups designed into 
the TEDDY study. Results cannot be guaranteed to reflect the whole 
TEDDY cohort or child populations in the respective countries. 
Future targeted approaches to identify subject-specific connections 
between the gut microbiota and T1D pathogenesis may be beneficial, 
particularly given the apparent population-level heterogeneity revealed 
here. For example, laboratory experiments involving dietary factors that 
have been associated with the onset of T1D? may reveal biochemically 
specific signals that are mediated by the microbiome. Different endo- 
types of disease, such as differences in the first appearing autoantibody 
(IAA versus GADA), the number of appearing autoantibodies, the time 
from seroconversion to T1D diagnosis, genetic host risk alleles and eth- 
nic backgrounds, may be characterized by distinct microbial configura- 
tions (Supplementary Note 6). Finally, components of the microbiome 
that were poorly measured in these data may also have crucial roles: 
viruses, fungi, microbial transcription or small-molecule biochemistry. 
By surveying these additional molecular activities by cross-sectional 
analysis and in more detailed longitudinal populations, this study lays 
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Fig. 4 | Bifidobacterium longum strains are characterized by HMO gene 
content and stratified by breastfeeding status. Gene families involved in 
HMO utilization and showing contrasting presence in B. longum genomes 
during breastfeeding (n = 1,584 samples) compared to after weaning 

(n= 3,705 samples). Abundance heat map columns represent stool samples 
in which the relative abundance of B. longum species was more than 10% 


the foundation to identify further gut microbial components that are 
predictive, protective or potentially causal in T1D risk or pathogenesis. 
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METHODS 
Cohort and study design. TEDDY is a prospective cohort study funded by the 


National Institutes of Health with the primary goal to identify environmen- 
tal causes of T1D. It includes six clinical research centres—three in the United 
States (Colorado, Georgia/Florida, Washington) and three in Europe (Finland, 
Germany and Sweden). Detailed study design and methods have been previously 
published”*4!?, Written informed consents were obtained for all study participants 
from a parent or primary caretaker, separately, for genetic screening and partic- 
ipation in a prospective follow-up. The TEDDY study was approved by local US 
Institutional Review Boards and European Ethics Committee Boards in Colorado's 
Colorado Multiple Institutional Review Board, Georgia’s Medical College of 
Georgia Human Assurance Committee (2004-2010), Georgia Health Sciences 
University Human Assurance Committee (2011-2012), Georgia Regents University 
Institutional Review Board (2013-2015), Augusta University Institutional Review 
Board (2015-present), Florida’s University of Florida Health Center Institutional 
Review Board, Washington state’s Washington State Institutional Review Board 
(2004-2012) and Western Institutional Review Board (2013-present), Finland’s 
Ethics Committee of the Hospital District of Southwest Finland, Germany’s 
Bayerischen Landesarztekammer (Bavarian Medical Association) Ethics 
Committee, Sweden's Regional Ethics Board in Lund, Section 2 (2004-2012) and 
Lund University Committee for Continuing Ethical Review (2013-present). The 
study is monitored by External Advisory Board formed by the National Institutes 
of Health. 

This analysis used stool samples and clinical metadata from two nested case- 
control studies (persistent, confirmed IA or T1D) using risk set sampling”. The 
data used here were collected as of 31 May 2012, as a 1:1 match in which one 
control per case of persistent confirmed IA or T1D was selected from the full 
TEDDY cohort. A control was a participant who had not developed persistent, 
confirmed IA or T1D by the time the case to which it was matched had developed 
IA or T1D, within +45 days of the event time. Matching factors were clinical cen- 
tre, sex and family history of T1D to control for differences in geographical area, 
genetic background and in sample or data handling between clinical centres. In all 
case-control comparisons, we removed all case-control pairs in which the control 
later progressed to case status (that is, progressed to IA or T1D). In additional, 17 
subjects with missing information about breastfeeding together with their matched 
pairs were excluded from the case-control comparisons to avoid confounding 
effects from unknown breastfeeding status. 

The development of persistent, confirmed IA was assessed every three months. 
Persistent autoimmunity was defined by the presence of confirmed islet autoanti- 
body on two or more consecutive visits. The date of persistent autoimmunity was 
defined as the draw date of the first sample of the two consecutive samples that 
deemed the child persistently positive for a specific autoantibody (or any autoan- 
tibody). T1D was defined according to American Diabetes Association criteria 
for diagnosis”. 

Stool samples were collected monthly starting at three months of age and con- 
tinuing up until 48 months of age, then every three months until 10 years of age 
and then biannually thereafter, into the three plastic stool containers provided 
by the clinical centre. Children who were antibody negative after 4 years of age 
were encouraged to submit four times a year even though after 4 years their visits 
schedule switched to biannual. Parents sent the stool containers at either ambient 
or +4°C temperature with guaranteed delivery within 24 h in the appropriate ship- 
ping box to the NIDDK repository if living in the United States or their affiliated 
clinical centre if living in Europe. The European clinical centres stored the stool 
samples and sent monthly bulk shipments of frozen stool to the NIDDK repos- 
itory. The TEDDY Manual of Operations, including the stool sample collection 
protocol, can be accessed online at https://repository.niddk.nih.gov/static/studies/ 
teddy/teddy_moop.pdf. 

A priori power calculations using discrete Cox’s proportional hazards regres- 

sion“ for the matched IA case-control study estimated 80% power, a = 0.01, two- 
sided test to detect an odds ratio > 3 for an exposure with 5% prevalence, to an 
odds ratio > 1.8 for an exposure with 20% prevalence. The experiments were not 
randomized, and investigators were not blinded to allocation during experiments 
and outcome assessment. 
Metagenomic sequencing and initial bioinformatics. Samples were metagenom- 
ically sequenced as one library each multiplexed through Illumina HiSeq machines 
using the 2 x 100-bp paired-end read protocol. Samples with limited DNA quantity 
and/or too few high-quality reads were filtered out, resulting in a discrepancy of 
sample frequencies between the metagenomic data and the 16S rRNA amplicon 
sequencing data analysed in the companion paper*’. Casava v1.8.2 (Illumina) 
output initial FASTQ files from the resulting data were processed using cutadapt 
v1.9dev2 for adaptor removal, Trim Galore v0.2.8 (Babraham Bioinformatics) for 
removing low-quality bases and PRINSEQ v0.20.3*° for sample demultiplexing. 
Bowtie2 v2.2.3 was used to map reads to the human genome for decontamination 
before subsequent analysis. 
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Taxonomic and functional profiling by MetaPhlAn2 and HUMAnN2. 
Taxonomic profiling of the metagenomic samples was performed using 
MetaPhlAn2* v2.6.0, which uses a library of clade-specific markers to provide 
pan-microbial (bacterial, archaeal, viral and eukaryotic) quantification at the spe- 
cies level. MetaPhlAn2 was run using default settings. 

Functional profiling was performed with HUMAnN2” v0.9.4. For an input 
metagenome, HUMAnN2 constructs a sample-specific reference database by 
concatenating and indexing the pangenomes of species detected in the sample by 
MetaPhlAn2 (pangenomes are pre-clustered, pre-annotated catalogues of open 
reading frames found across isolate genomes from a given species**). HUMAnN2 
then maps sample reads against this database to quantify gene presence and 
abundance in a species-stratified manner, with unmapped reads further used in 
a translated search against UniRef90 to include taxonomically unclassified but 
functionally distinct gene family abundances. Finally, for community-total, spe- 
cies-stratified, and unclassified gene family abundance, HUMAnN2 reconstructs 
metabolic pathway abundance based on the subset of gene families annotated to 
metabolic reactions (based on reaction and pathway definitions from MetaCyc>). 
Enzyme (level-4 Enzyme Commission (EC) categories) abundances were further 
computed by summing the abundances of individual gene families annotated to 
each EC number based on UniRef90-EC annotations from UniProt®. 
Phenotype and covariate analysis. This study includes extensive collection of 
clinical covariates that cover several aspects of common and rare life events in 
early childhood from infancy up to five years of age. In these analyses, we used 
information that is, according to the literature, of high relevance in terms of gut 
microbiome development. Information about mothers, pregnancy and birth was 
collected during the three-month clinic visit by questionnaire and included the 
mode of birth (vaginal birth versus caesarean section), gestational age, infant's 
5-min Apgar score, information about maternal diabetes (T1D, T2D or gesta- 
tional diabetes) and maternal insulin and medication use (antibiotics, angiotensin- 
converting enzyme inhibitors, metformin, glyburide, antihypertensives) during 
pregnancy. Dietary information used in these analyses includes the start (and end) 
date for the following dietary compounds: breastfeeding, baby formula, cow’s milk, 
gluten, cereals, meat, vegetables and fruits. The start of solid food (anything other 
than breast milk or cow’s milk) was also analysed separately. The T1D-associated 
autoantibodies IAA, GADA and IA2A were analysed from serum samples collected 
at each clinic visit. In addition to IA, defined as persistent, confirmed autoantibody 
seropositivity, we analysed the data in terms of the persistency and cumulative 
frequency of autoantibodies (single or multiple autoantibodies). In TEDDY, all pre- 
scribed antibiotic courses are recorded. We further stratified these data by the type 
of antibiotic in five categories: amoxicillin, penicillin, cephalosporins, macrolide 
and other antibiotics. Information about probiotics covered the dates for starting 
and stopping probiotic supplementation, but not the specific types of probiotics 
used. In addition, sex, information about whether first degree relatives in family 
had T1D, and HLA haplotypes of the subjects were used in these analyses. Subjects 
screened from the general population were identified with high-risk alleles (89%) 
including: DRB1*04-DQA1*03-DQB1*03:02/DRB1*03-DQA1*05-DQB1*02:01 
(DR3/4), DRB1*04-DQA1*03-DQB1*03:02/DRB1*04-DQA1*03-DQB1*03:02 
(DR4/4), DRB1*04-DQA1*03-DQB1*03:02/DRB1*08-DQA1*04-DQB1*04:02 
(DR4/8) and DRB1*03-DQA1*05-DQB1*02:01/DRB1*03-DQA1*05-DQB1*02:01 
(DR3/3), plus six genotypes specific to first-degree relatives”®. 

Principal coordinate analysis (PCoA) ordination was generated using t-distrib- 
uted stochastic neighbour embedding (t-SNE) as implemented in Rtsne package 
in R with Bray—Curtis dissimilarity as the distance measure and perplexity (a free 
parameter) equal to 50. Statistical significance of the trends between early clusters 
and metadata were tested using mixed-effect logistic regression and samples 
collected during the first year of life as follows. The target variable used was a 
binary indicator of whether the relative abundance of the taxon of interest (three 
different Bifidobacterium species or phylum Proteobacteria) was greater than 0.5 
(definition of the cluster). The age of sample collection, mode of delivery, clinical 
centre, breastfeeding status (ongoing/stopped), solid food status (binary variable 
indicating whether solid food was introduced in the diet) and antibiotics status 
(binary variable indicating whether the subject received antibiotics during the last 
30 days) were used as fixed effects, and the subject ID was used as a random effect. 

Associations between microbial feature abundances and clinical outcome were 
determined using MaAsLin™. In brief, this multivariate linear modelling system for 
microbial data selects from among a set of (potentially high-dimensional) covari- 
ates to associate with microbial taxon or pathway abundances. Mixed-effects linear 
models using a variance-stabilizing arcsin square root transform on relative abun- 
dances are then used to determine the significance of putative associations from 
among this reduced set. In the models, subject ID was used as a random effect, 
and the age of sample collection, mode of delivery, clinical centre (for cohort-wide 
comparisons), breastfeeding status (ongoing or stopped), solid food status (binary 
variable indicating whether solid food was introduced in the diet), number of 
sequencing reads and case-control outcome were used as fixed effects. Nominal 
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P values were adjusted using the Benjamini-Hochberg FDR method. Here, 
microbial features with corrected q < 0.25 were reported. For metabolic pathways, 
pseudocount 2° was added to CPM values to stabilize the variation in lowly abundant 
and/or prevalent but highly variable categories, and data were log-transformed. 

As previously described“, to associate microbial diversity with covariates while 
accounting for nonlinear, age-dependent effects, we first fitted a sigmoid function 
(nls function in R) to account for the longitudinal trend. Residuals of this model 
were then used as inputs for a mixed-effect model (glmmPQL function in the 
MASS R package), with subject IDs as random effects to account for repeated 
measurements in the data. Other factors were included in the model as fixed effects, 
and their significance levels were evaluated using P values reported by the model 
(Supplementary Table 2). 

The association between T1D case-control outcome and microbial alpha diver- 
sity in individual clinical centres was tested using a linear mixed-effects model 
(glmmPQL function in MASS R package) on samples 730 days or less before T1D 
diagnosis. In the model, the age at stool sample collection and T1D case-control 
outcome were used as fixed effects, and subject ID was used as a random effect. 
Microbial variance explained by clinical and other covariates. Variance analysis 
was conducted using the adonis function in the vegan R package given a Bray- 
Curtis dissimilarity matrix of the taxonomic profiles and all TEDDY clinical 
metadata listed above. In brief, adonis conducts multivariate ANOVA using the 
dissimilarity matrix (that is, partitions the sums of squares) given the metadata as 
covariates. Statistical significance of the fit was assessed using permutation tests. 
HMO gene homology. The HMO gene cluster homologues between B. longum 
subsp. infantis and multiple taxa were analysed as follows. UniRef90 gene families 
corresponding to the protein sequences in the B. longum subsp. infantis HMO gene 
cluster® (protein sequences Blon_2331-Blon_2361 in NCBI protein sequence data- 
base) were identified by translated BLAST search against ChocoPhlAn pangenome 
collection*® used by HUMAnN2. Identified hits were further filtered by requiring 
>50% alignment identity and >80% mutual coverage. Combining this informa- 
tion with HUMAnN2 species-stratified UniRef90 gene family quantification ena- 
bled calling these genes present given that they had sufficient read coverage, here 
defined as log;9(counts per million) > 0.1 in at least 50 samples collected during 
breastfeeding. Differential gene prevalence during breastfeeding was tested using 
the samples in which the carrier species had >1% relative abundance. Testing was 
conducted using the test of equal or given proportions (prop.test function in R) 
and by comparing the prevalence (proportion of the samples for which the species 
in question harboured the gene according to the metagenomic data) of the gene in 
samples collected during breastfeeding with the samples collected after weaning. 
P values were adjusted for multiple testing by Benjamini-Hochberg method (p.ad- 
just function in R). All homologues together with their BLAST search metrics, 
prevalence in the metagenomic data and corresponding B. infantis HMO gene are 
reported in Supplementary Table 5. 

Bacterial growth assays. Bifidobacterium bifidum strain RJX-1201, 
Bifidobacterium breve RJX-1202 and Bifidobacterium longum RJX-1203 were 
streaked on brain heart infusion agar (BD) supplemented with 1% vitamin 
K/hemin solution (BD; sBH1]), and incubated for 48 h in a vinyl anaerobic chamber 
(Coy Laboratory Products) containing 5% CO, 5% Hz and 90% N2 and main- 
tained at 37°C. Cells were transferred to sBHI liquid medium (BHI broth, BD, 


supplemented as above) and grown for 24 h in anaerobic conditions. Cultures 
were washed twice with PBS and optical density at 600 nm (OD¢o9) was measured 
using a BioTek PowerWave 340 plate reader. OD¢o9 was normalized to 0.2 for all 
strains and 5 iil bacteria inoculum was added to a final volume of 200 1l containing 
10% sBHI and 125 mM carbon source (glucose, fucose, galactose or lactose) in 
a 96-well plate. OD¢o9 was measured in the plate reader every hour for 48 h with 
5 s of medium shaking before each measurement. All of the measurements were 
normalized to a medium-only blank. Experiment was repeated three times (n = 3) 
in triplicate and one representative experiment is shown. Error bars are s.d. of 
three technical replicates. 

Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 

Code availability. Code for Random Forest case-control comparisons and cohort 
wide MaAsLin association analyses in Supplementary Table 4 has been made pub- 
licly available at https://github.com/tvatanen/broad_teddy_microbiome_analyses. 
Other analysis software including quality control, taxonomic, and functional pro- 
filers is publicly available and referenced as appropriate. 


Data availability 

TEDDY microbiome 16S and whole-genome sequencing data that support 
the findings of this study are available in the NCBI database of Genotypes and 
Phenotypes (dbGaP) with the primary accession code phs001443.v1.p1, in accord- 
ance with the dbGaP controlled-access authorization process. Clinical metadata 
analysed during the current study are available in the NIDDK Central Repository 
at https://www.niddkrepository.org/studies/teddy. 
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Extended Data Fig. 1 | Heterogeneity in early taxonomic profiles. B. breve, B. bifidum and B. longum grown individually in low-nutrient 
a-d, Relative abundances of taxonomic groups highlighted by weighted medium (10% sBHI) supplemented with single carbon sources (glucose 
averages in Fig. 2a (arrows) shown separately (n = 10,913 samples). (g), galactose (h), fucose (i) and lactose (j)) or grown in 100% sBHI (k). 
e, f, Average longitudinal abundance of B. breve (e) and B. longum (f) As a negative control, growth curves of each strain grown in 10% BHI 
per clinical centre (n = 10,194 samples). The curves show LOESS fits without additional sugar are shown in black for each condition. Data are 
for the relative abundances, and shaded area shows 95% confidence representative of three independent experiments and are presented as the 
interval for each fit, as implemented in geom_smooth function in mean and s.d. of triplicate assessments. 


ggplot2 R package. g-k, Growth curves of human infant isolates of 
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Extended Data Fig. 3 | Accrual of microbial alpha diversity. a, Shannon's 
diversity of the taxonomic profiles of the gut microbial communities 

(n= 10,913 samples) with respect to the age at the sample collection. 

The curve shows the generalized additive model (GAM) fit for the 

data, and the shaded area shows the 95% confidence interval for each 

fit, as implemented in geom_smooth function in ggplot2 R package. 

b, Shannon's diversity for the samples in the IA case-control cohort 
(n=7,051) with respect to the time to the appearance of first autoantibody 
(seroconversion). The curves show LOESS fits for cases and controls 


Time to T1D diagnosis (days) 


separately, and the shaded area shows 95% confidence intervals for each 
fit. c, Shannon's diversity for the samples in the T1D case-control cohort 
(n= 3,309) with respect to the time to T1D diagnosis. The curves and 
shaded areas are as in b. d, As in c, but only for data (n = 983 samples) for 
subjects in Finland. No difference between cases and controls. e, As in c, 
but only for data (n = 142 samples, n = 6 subjects) for subjects in Georgia, 
USA. Cases show a drop in alpha diversity before the diagnosis of T1D 
(linear mixed-effects model, P= 0.0033). 
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were given between the two samples. For each notched box plot, the box were detected between the antibiotic types. No antibiotics, n = 7,130; 
denote the interquartile range (IQR), the horizontal line denotes the amoxicillin, n = 268; penicillin, n = 90; cephalosporin, n = 51; macrolide, 
median, and the notch denotes the approximation for the 95% confidence n= 60; other, n= 101. f, Decreases in relative abundance of bacteria over 
interval (notch width = 1.58 x IQR/n®®, in which n is the number of antibiotic courses. Bacteria for which the bootstrapped 95% confidence 
samples per box plot). Compare to Fig. 2c. b, Influence of antibiotic interval of the fold change does not overlap zero are shown. Fold change 
courses on microbial diversity. Notched box plots denote the increase was measured between consecutive samples with an antibiotic course 
(difference) in diversity between two consecutive stool samples (<50 between them, given that the species in question was present in the first of 
days apart) stratified by antibiotic administration between the samples. the two samples. Sample size per species (m) indicate the number of sample 
Data show no difference between the groups (antibiotics versus no pairs in which the species in question was present in the sample preceding 
antibiotics). c, Influence of antibiotics courses on microbial diversity by the antibiotic treatment. Bars denote bootstrapped mean log;(fold 
antibiotic type; data from b stratified into one-year time windows (x axis) change) (that is, decrease), and error bars denote s.d. (n = 1,000 bootstrap 
and antibiotic types. No significant differences were detected between samples). 


the antibiotic types. d, e, Influence of antibiotic courses on microbial 
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Extended Data Fig. 5 | Dynamics of species-specific microbial 
functional potential during early gut development. a, b, Stability of 
microbial pathways (n = 10,580 samples) measured by Bray—Curtis 
dissimilarity (a) and the Jaccard index (b) and stratified into three 
groups: within subject, within clinical centre, and across clinical centres. 
Although the baseline level of functional similarity is significantly 
greater than that of taxa (see Fig. 2b), functional states and development 
trajectories also both retain a level of personalization. The stability of 
the functional profiles was evaluated in three-month time windows, 
over two-month increments. Lines show the median per time window, 
and shaded area denotes the 99% confidence interval estimated using 
binomial distribution. c, d, Proportion of metagenomic gene abundance 
with functional annotation through Gene Ontology (c) and MetaCyc 
(d) databases. The metagenomic reads were divided into the following 


Age (years) 


categories: reads that could be mapped to genes with functional 
assignment in the database in question (annotated), and reads with no 
annotation but alignment to species pangenomes or UniProt proteins 
(unannotated). The proportion of the unknown genes (unmapped) 

was estimated using the number of reads with unknown origin. e, The 
proportion of unmapped reads, reflecting the relative abundances of reads 
not mappable to any microbial pangenomes in the available reference set 
or to UniProt. An increasing trend of unmapped reads with respect to 

the age at sample collection continued through approximately two years 
of age. f, The proportion of reads with confident functional annotation 

in MetaCyc within the genes that mapped to species pangenomes or 
UniProt proteins. The data again showed an increasing longitudinal trend, 
implicating a deficit of functional and biochemical annotations within 
microorganisms that are abundant during the first year of life. 
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Extended Data Fig. 6 | Differences between cases and controls. a, The 
gut microbiome functional (left) and taxonomic (right) profiles were 
classified between cases and controls using leave-one-out cross-validation 
(n = 3,366 samples), in which one case-control pair was held-out in 

turn. Data show error rates for classifying these held-out samples per 

fold (a data point per fold, n = 100 folds). This suggests weak but better- 
than-random classification between cases and controls. Notched box 
plots are as in Extended Data Fig. 4. b, Average longitudinal abundance 
of Ruminococcus gnavus in Finland (n = 2,630 samples) stratified by the 
number of observed persistent autoantibodies (AABs); no autoantibodies 
(that is, healthy control), a single autoantibody, or multiple (two or 

more) autoantibodies. c, Average longitudinal abundance of Lactobacillus 
rhamnosus in IA cases and controls (n =7,017 samples). L. rhamnosus 

is more abundant in controls (q= 0.055). The curves in b and c show 
LOESS fit per group, and shaded areas show 95% confidence interval for 


Lactobacillus rhamnosus 


each fit, as implemented in geom_smooth function in ggplot2 R package. 
d, Abundance (left) and prevalence (right) of Lactobacillus reuteri and 

L. rhamnosus in the first stool sample of each individual (collected at 
approximately three months of age) in association with early probiotic 
supplementation. ‘No probiotic’ indicates no probiotics given before the 
first stool sample (n = 583); ‘later probiotic refers to probiotics given 
later than the first four weeks but before the first stool sample (n = 45); 
‘early probiotic refers to probiotics given during the first four weeks of 
life (n = 84). Numbers (n) per clinical centre are given in Extended Data 
Table 2. L. reuteri and L. rhamnosus were more abundant and prevalent 
in groups with probiotics supplementation. Visual jitter was added to 
make data equal to zero distinguishable, and boxes denote the IQR, when 
applicable. The shown P values were obtained by applying Fisher’s exact 
test (two-sided) to presence or absence count data (counting samples in 
which the species were present). 
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D6ZY10: Dihydrodipicolinate synthetase 

D6ZY13: No name 

B7GNP2: Mandelate racemase/muconate 

lactonizing enzyme, C-terminal domain protein 
F6C8A7: Oxidoreductase, short chain (064, 


dehydrogenase/reductase family protein 


Samples Samples 
0.9 0.8 0.7 0.5 0.6 3 2 1 0 
Gene prevalence Gene abundance 
log10 [( CPM / B. pseudocatenulatum rel. ab. ) + 1] 

Extended Data Fig. 7 | Contrasting HMO utilization genes in relative abundance of B. pseudocatenulatum species was greater than 10% 
B. pseudocatenulatum. The gene families involved in HMO utilization (n= 1,325 samples). Rows and columns were ordered by hierarchical 
and that show contrasting presence in B. pseudocatenulatum genomes clustering using complete linkage method. Compare to Fig. 4, which 
during breastfeeding (n = 321 samples) compared to after weaning shows similar data for B. longum. UniRef90 identifiers and gene names or 
(n= 1,004 samples). Columns represent stool samples in which the families are indicated on the left. 


© 2018 Springer Nature Limited. All rights reserved. 


LETTER 


Extended Data Table 1 | Summary of TEDDY microbiome cohort 


US, Colorado US, Georgia US, Washington Finland Germany Sweden 
T1D cases (samples) 14 (274) 3 (89) 8 (111) 34 (553) 13 (246) 29 (532) 
IA cases (samples) 39 (689) 17 (252) 25 (368) 70 (900) 21 (292) 95 (1,542) 
Healthy controls (samples) 61 (906) 22 (250) 36 (399) 119 (1,273) 40 (512) 137 (1,725) 
Sex 
Male / Female 61/53 19/23 51/18 117/106 30/44 152/109 
Ethnic background 
White, non-hispanic 86 (75.4%) 41 (97.6%) 56 (81.2%) N/A N/A N/A 


Mode of birth 
Caesarean section 


Probiotic supplementation 
Probiotics during first 4 weeks 
Probiotics during follow-up 


Breastfeeding 

Median duration (days) 

duration, 25 percentile 

duration, 75 percentile 

Number of subjects never breastfed 


Maternal characteristics 

Maternal T1D 

Maternal T2D 

Gestational diabetes 

Antibiotics during pregnancy 
Metformin during pregnancy 
Glyburide during pregnancy 
Antihypertensives during pregnancy 
Insulin during pregnancy 


41 (36.0%) 


0 
22 (19.3%) 


7 (6.1%) 
2 (1.8%) 
5 (4.4%) 
21 (18.4%) 
1 (0.9%) 
2 (1.8%) 
4 (3.5%) 
9 (7.9%) 


22 (52.4%) 


2 (4.8%) 
13 (31.0%) 


301 
145 
365 
3 


0 
0 

5 (11.9%) 
10 (23.8%) 
0 

2 (4.8%) 

3 (7.1%) 

0 


25 (36.2%) 


0 
9 (13.0%) 


335 
171 
440 
1 


3 (4.3%) 
0 

5 (7.2%) 
5 (7.2%) 
0 

2 (2.9%) 
4 (5.8%) 
3 (4.3%) 


42 (18.8%) 


67 (30.0%) 
162 (72.6%) 


289 
152 
385 
0 


14 (6.3%) 
0 

32 (14.3%) 
40 (17.9%) 
1 (0.4%) 

0 


5 (2.2%) 
23 (10.3%) 


23 (31.1%) 


7 (9.5%) 
33 (44.6%) 


278 
140 
367 
0 


18 (24.3%) 
0 

3 (4.1%) 
13 (17.6%) 
0 

0 

3 (4.1%) 
19 (25.7%) 


46 (17.6%) 


7 (2.7%) 

0 

6 (2.3%) 
29 (11.1%) 
0 

0 

0 

8 (3.1%) 


Data on subjects’ ethnic background were not systematically collected in European clinical centres but these study populations were predominantly white, non-Hispanic. Reported antihypertensive 
drugs were atenolol (n= 2), bisoprolol (n= 1), labetalol (n =6), methyldopa (n= 1), methyldopa plus methyldopate (n = 3), metoprolol (n= 4) and nifedipine (n =5). No use of angiotensin-converting 
enzyme (ACE) inhibitors was reported. Numbers indicate the number of subjects (n) if not specified otherwise. 
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Extended Data Table 2 | Antibiotics and probiotics 
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US, Colorado US, Georgia US, Washington Germany Sweden 
Subjects with abx prescriptions 93 (81.6%) 37 (88.1%) 54 (78.3%) 56 (75.7%) 192 (73.6%) 
Median number of abx per subject (25th and 75th percentile) 2 (1-6) 5 (2-9) 2 (1-4) 2 (0-5) 2 (0-4) 
Number of abx by type (prescriptions per subject) 
Amoxicillin 242 (2.12) 147 (3.50) 104 (1.51) 45 (0.61) 134 (0.51) 
Cephalosporins 87 (0.76) 65 (1.55) 31 (0.45) 51 (0.69) 23 (0.09) 
Macrolide 54 (0.47) 35 (0.83) 47 (0.68) 33 (0.45) 23 (0.09) 
Penicillin 6 (0.05) 2 (0.05) 3 (0.04) 13(0.18) 412 (1.58) 
Other 76 (0.67) 80 (1.90) 33 (0.48) 77 (1.04) 154 (0.59) 
Total 465 (4.08) 329 (7.83) 218 (3.16) 219 (2.96) 746 (2.86) 
Probiotic use in early life 
Early probiotic 0 (0.0%) 1 (2.9%) 0 (0.0%) 7 (10.0%) 13 (5.6%) 
Later probiotic 1 (0.9%) 1 (2.9%) 2 (3.3%) 8 (11.4%) 17 (7.3%) 


No probiotic 


109 (99.1%)  32(94.1%) 59 (96.7%) 


55 (78.6%) 


202 (87.1%) 


Top, 3,678 antibiotic prescriptions in the TEDDY microbiome study population by clinical centre. Bottom, early probiotic supplementation in TEDDY clinical centres. Probiotic use was stratified into 
three categories: probiotics during the first 4 weeks of life (early probiotic); probiotics before the first stool sample (roughly at three months) but not the first 4 weeks (later probiotic); and no probiotics 
before the first stool sample (no probiotic). Data for probiotics are presented as n (percentage). 
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OTX2 restricts entry to the mouse germline 


Jingchao Zhang'*, Man Zhang!**, Dario Acampora’, Matus Vojtek!, Detian Yuan, Antonio Simeone? & Ian Chambers!* 


The successful segregation of germ cells from somatic lineages is 
vital for sexual reproduction and species survival. In the mouse, 
primordial germ cells (PGCs), precursors of all germ cells, are 
induced from the post-implantation epiblast!. Induction requires 
BMP4 signalling to prospective PGCs? and the intrinsic action of 
PGC transcription factors**. However, the molecular mechanisms 
that connect BMP4 to induction of the PGC transcription factors 
that are responsible for segregating PGCs from somatic lineages 
are unknown. Here we show that the transcription factor OTX2 
is a key regulator of these processes. Downregulation of Otx2 
precedes the initiation of the PGC programme both in vitro and 
in vivo. Deletion of Otx2 in vitro markedly increases the efficiency 
of PGC-like cell differentiation and prolongs the period of PGC 
competence. In the absence of Otx2 activity, differentiation of 
PGC-like cells becomes independent of the otherwise essential 
cytokine signals, with germline entry initiating even in the absence 
of the PGC transcription factor BLIMP1. Deletion of Otx2 in 
vivo increases PGC numbers. These data demonstrate that OT X2 
functions repressively upstream of PGC transcription factors, acting 
as a roadblock to limit entry of epiblast cells to the germline to a 
small window in space and time, thereby ensuring correct numerical 
segregation of germline cells from the soma. 

Different species form their germ cells by either of two general meth- 
ods: segregation of preformed germplasm, or induction by signalling”®. 
In mammals, germ cell precursors arise by induction? !!. In the mouse, 
competence to initiate germ cell development is restricted to a few 
cells within the E5.5-E6.25 epiblast!. BMP4 from the extraembryonic 
ectoderm acts on these competent cells to specify germ cell identity”. 
Specification also requires transcription factors, notably BLIMP1, AP2y 
and PRDM14*-°. However, the molecular mechanisms that connect the 
exposure of competent cells to BMP4 to the activation of PGC trans- 
cription factors are obscured by limited access to the peri-implantation 
embryo. Recently, a system for differentiation of PGC-like cells 
(PGCLCs) from embryonic stem cells (ESCs) via germline competent 
epiblast-like cells (EpiLCs)'” has opened up investigation of molecular 
events segregating germline and soma. 

During the ESC to EpiLC transition, the transcription factor OTX2 
becomes expressed and redirects binding of OCT4 to genomic regu- 
latory elements!*4. OTX2 was previously characterized as a regulator 
of anterior patterning'>!°, Previous work has demonstrated that OTX2 
and NANOG have antagonistic functions in ESCs!”"®. A positive role 
for NANOG in PGCLC differentiation has now also been added to 
the known requirements for Blimp 1, Prdm14 and Tfap2c (also known 
as Ap27)'®-?!. We therefore assessed expression of the corresponding 
mRNAs following addition of PGCLC-inducing cytokines to EpiLCs 
(Fig. 1a, b). Blimp1, Prdm14 and AP2y mRNAs did not change during 
the first 12 h. A modest increase in Ap2 mRNA at 24 h preceded 
more pronounced increases in all three mRNAs by 48 h (Fig. 1b). By 
contrast, Otx2 mRNA dropped to around 20% of the EpiLC level at 
24 h (Fig. 1b). Immunofluorescence analysis indicated that the pro- 
portion of cells expressing OTX2 protein decreased at 24 h, with 
almost no OT X2-expressing cells detected at 48 h (Fig. 1c, Extended 
Data Fig. 2a, b). Cultures in which PGCLC cytokines were omitted 


lost OTX2-expressing cells more slowly (Extended Data Fig. 2a, b). 
Moreover, while Otx2 mRNA declines upon FGF/Activin withdrawal, 
the kinetics of suppression are enhanced by PGCLC cytokine addition 
(Extended Data Fig. 2d). This suggests that PGCLC cytokines directly 
repress Otx2 transcription, a notion supported by the prompt decline in 
Otx2 pre-mRNA upon switching EpiLCs into PGCLC media (Extended 
Data Fig. 2e). BLIMP1 and AP2‘ proteins were initially detectable at 
24h, but only in cultures treated with cytokines (Extended Data Fig. 2a, b) 
and only in cells with reduced OTX2 (Fig. 1c, d, Extended Data Fig. 2c). 
These results suggest that before the PGC gene regulatory network 
(GRN) becomes activated, the transcriptional circuitry of the forma- 
tive pluripotent’, germline competent”? state characterized by OTX2 
expression’? becomes extinguished. 

The reciprocal relationship of OTX2 with BLIMP1 and AP27 dur- 
ing PGCLC induction prompted determination of whether a similar 
spatio-temporal relationship between changes in expression of OTX2 
and PGC transcription factors held in vivo. Whole-mount immuno- 
fluorescence of pre-streak stage embryos indicated that all epiblast cells 
express both OCT4 and OTX2 (Fig. le). At early- to mid-streak, OT X2 
remains widely expressed in the epiblast except for cells showing incipient 
FRAGILIS™ and BLIMP1? expression (Fig. 1f, h). By late-streak- to early- 
bud-stage, BLIMP] is clearly detectable within the FRAGILIS field in cells 
lacking OTX2 (Fig. 1, i). These results indicate that OTX2 is expressed 
ubiquitously in pre-streak epiblast cells but is specifically downregulated 
in the prospective PGC population before BLIMP1 expression. 

Cytokine addition is required for PGCLC differentiation'”. To assess 
when, cells were treated for one, two or six days with cytokines and 
analysed by FACS for surface expression of SSEA1 and CD61, which 
together act as a marker for PGCLCs’”. Cytokine treatment for the 
first day induces around half of the potentially responsive population 
to express both CD61 and SSEA1 (Extended Data Fig. 2f). Two or six 
days of cytokine treatment were equally effective at inducing CD61 and 
SSEA1 (Extended Data Fig. 2f). This suggests that cytokine exposure 
reaches maximum efficacy at two days, the time required to reduce 
OTX2 to minimal levels and initiate PGCLC transcription factor 
expression (Fig. 1b, d, Extended Data Fig. 2a-c). 

To directly assess whether OTX2 downregulation influences entry 
of pluripotent cells into the germline, Otx2-null cells were examined. 
A transgenic Oct4APE::GFP reporter activated upon germline entry” 
was added to Otx2"/- and Otx2~/— ESCs!” (Extended Data Fig. 1). 
Compared to Otx2 heterozygotes, Otx2~'~ cell populations showed 
widespread activation of Oct4APE, with essentially all cells activat- 
ing GFP (Fig. 2a). Furthermore, the SSEA1*CD61* cell number is 
increased 5-10-fold in Otx2~'~ versus Otx2t’* cells (Fig. 2b). This 
was also the case in independently generated Otx2~/~ ESCs (Extended 
Data Fig. 3b) and in additional independent Otx2~/~ ESCs generated 
using CRISPR/Cas9 (Extended Data Fig. 1, 3c). Three new ESC lines 
lacking OT X2 protein (Extended Data Fig. 3d) showed enhanced CD61 
and SSEA1 expression during PGCLC differentiation (Extended Data 
Fig. 3e). These results confirm that a lack of OTX2 promotes germline 
differentiation. 

To investigate at which stage of differentiation OTX2 influences 
germline entry, an Otx2-ER™ transgene (enabling tamoxifen-induced 
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Fig. 1 | Otx2 expression is downregulated before expression of 

PGC transcription factors. a, Scheme for PGCLC differentiation. 

b, Top, scheme illustrating the time-points (hours) during PGCLC 
differentiation when mRNAs were analysed. Bottom, real-time PCR 
(RT-PCR) of Otx2 and PGC transcription factors in wild-type E14Tg2a 
ESCs. Expression levels are normalized to TBP; values are means + s.d.; 
n= 3 biologically independent replicates. c, Single-cell quantification of 
immunofluorescence for OTX2 and AP27 in cytospin preparations of 
EpiLCs and cell aggregates at day 1 and day 2 of PGCLC induction. Two 
biologically independent replicates were performed. d, Whole-mount 
immunofluorescence of E14Tg2a aggregates after one day of PGCLC 
differentiation; n = 3; scale bars, 501m (top), 10,1m (bottom). 

e-g. Representative confocal images of whole mount staining of embryos 
at pre-streak (e, n= 4), early streak (f, n = 3) and late streak (g, n=3) 
stages; scale bars, 401m (e), 100 1m (f, g). h, i, Magnified image of the 
regions highlighted in f and g, respectively. OT X2-negative cells expressing 
BLIMP1 and FRAGILIS are outlined (g, h); scale bar, 201m. 
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re-localization of OTX2) was introduced into Otx2!#°“/CFP ESCs (Fig. 2c, 
Extended Data Fig. 4a). Tamoxifen treatment for the first two days 
suppressed emergence of SSEA1*CD61* cells (Fig. 2d). These results 
establish that enforcing OTX2 activity at a time when cells are compe- 
tent to enter the germline!” and when cytokines act to decrease endoge- 
nous Otx2 expression, is sufficient to block cytokine-mediated PGCLC 
differentiation. 

To examine the mechanism by which OTX2 inhibits PGCLC dif- 
ferentiation, mRNAs were analysed. Expression of PGCLC transcrip- 
tion factor mRNAs occurred sooner and to a greater extent without 
OTX2 (Fig. 2e). In contrast, enforcing OTX2 activity inhibited induc- 
tion of PGCLC transcription factor mRNAs (Fig. 2e). NANOG also 
directs PGCLC differentiation”®”®. Consistent with this, endogenous 
Nanog mRNA was induced precociously in Otx2~'~ cells (Fig. 2e). In 
addition, although Fgf5, Foxd3 and Oct6 mRNAs were not induced 
without OTX2 (Extended Data Fig. 3f), their expression was increased 
above wild-type levels by OTX2 induction (Extended Data Fig. 3f). 
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These analyses suggest that OT X2 acts at the juncture between somatic 
and germline differentiation and inhibits PGCLC differentiation by 
preventing PGC transcription expression. 

A previous report has provided evidence for the involvement of 
T (Brachyury) in PGCLC induction”’ by showing that BMP4 induced 
T expression via endogenous Wnt. We also found that T is activated 
robustly only when BMP4 is present (Extended Data Fig. 4b) with 
T and other somatic markers induced during PGCLC differentia- 
tion (Extended Data Fig. 4c). In vivo, BMP4 induces epiblast cells to 
secrete Wnt?®. Therefore, to assess whether Wnt acts as an interme- 
diary between BMP4 and activation of T and other somatic mark- 
ers, Wnt signalling was mimicked by adding CHIR99021 to basal 
media (Extended Data Fig. 4d). T, Hoxal and Hoxb1 mRNAs were 
induced by CHIR99021 but Otx2 mRNA was repressed; effects that 
were reversed by addition of the Wnt antagonist XAV939 (Extended 
data Fig. 4d). Therefore, Otx2 downregulation by BMP4 may occur 
via Wnt signalling. To further assess this, XAV939 was added during 
PGCLC differentiation. XAV939 did not affect Otx2 mRNA for 9 h, 
but dampened further reduction (Extended Data Fig. 4e). Moreover, 
XAV939 diminished induction of Blimp1 and Prdm14 mRNAs seen 
after 24 h (Extended Data Fig. 4e). This is consistent with a model in 
which BMP induction of Wnt enforces timely repression of Otx2 and 
full induction of Blimp1 and Prdm14. Finally, Otx2~'~ cells did not 
activate T mRNA (Extended Data Fig. 4c) or protein (Extended Data 
Fig. 4f). Therefore, activation of T is dispensable for PGC induction, at 
least when OT X2 is absent. These observations suggest that the effects 
of Wnt signalling during PGC differentiation could be attributed to 
OTX2 downregulation. 

To assess whether OTX2 can interfere with the function of an estab- 
lished PGCLC GRN, OTX2 activity was restored at day 2, once the 
PGC GRN was already activated (Fig. 2e). This produced a similar 
proportion of SSEA1*CD61* cells as cultures receiving no tamoxifen 
(Extended Data Fig. 4g, h). Therefore, OT X2 does not impair the func- 
tion of an established PGC network, but rather restricts the efficiency 
with which EpiLCs enter the germline. 

PGCLLC induction is considered to strictly require cytokine addi- 
tion’. Consistent with this, Oct4APE::GFP was not expressed by 
aggregates of Otx2!”~ cells cultured in the absence of cytokines 
(Fig. 3a). However, in Otx2—'~ cells, cytokines were not essential for 
OctAPE reporter activation (Fig. 3a), CD61 and SSEA1 surface expres- 
sion (Fig. 3b, Extended Data Fig. 5a, b) or PGC transcription factor 
expression (Extended Data Fig. 5c, d). Indeed, mRNA profiling indi- 
cated that Otx2t/+ and Otx2~'~ EpiLCs were transcriptionally similar 
and that following differentiation, Otx2~/~ cells resembled wild-type 
PGCLCs, irrespective of their exposure to cytokines (Extended Data 
Fig. 6a). Principal component and ternary analysis confirmed these 
assessments (Fig. 3c, Extended Data Fig. 6b). These results indicate that 
without OTX2, germline entry does not require cytokines. 

BLIMP1 is essential for wild-type cells to access the germline**. 
To determine whether Otx2~/~ cells retained this dependency, both 
Blimp1 alleles were deleted from ESCs of distinct Otx2 genotypes 
using CRISPR-Cas9 (Extended Data Figs. 1, 7a-c). PGCLC differen- 
tiation confirmed a BLIMP1 requirement for germline entry of OT X2- 
expressing cells (Fig. 3d, Extended Data Fig. 7d). However, deletion of 
Blimp1 from Otx2~'~ cells did not affect the ability of Otx2~~ cells to 
induce CD61 and SSEA1 (Fig. 3d, Extended Data Fig. 7d). Although 
deletion of Blimp1 from wild-type cells increased expression of 
somatic transcripts during PGCLC differentiation, this did not occur 
in Otx2~'"Blimp1~'~ PGCLCs (Extended Data Fig. 8a). Nor did dele- 
tion of Blimp1 impair the enhanced ability of Otx2~/~ cells to activate 
expression of Prdm14, Ap2y, Nanog or Oct4 mRNAs (Extended Data 
Fig. 8b) or DAZL protein (Extended Data Fig. 8c). During differenti- 
ation, PGCLCs that express OCT4 have higher H3K27me3 and lower 
H3K9me2 than OCT4-low cells. These relationships were maintained 
in Otx2~'~ and Otx2~/~ Blimp1~'~ PGCLCs (Extended Data Fig. 8d). 
Furthermore, without PGCLC cytokines, CD61/SSEA1 expression 
and Oct4APE reporter activation were unaffected by Blimp1 deletion 
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Fig. 2 | Otx2—'~ EpiLCs have an enhanced propensity to differentiate 
into PGCLCs. a, Representative morphologies and Oct4APE::GFP 
expression in aggregates at day 4 of PGCLC differentiation in the 
presence of cytokines. Percentages indicate GFP-positive cells; n = 9; 

scale bar, 200 1m. b, E14Tg2a and Otx2~'~ cells were assessed by flow 
cytometry for surface expression of SSEA1 and CD61 at days 6 of PGCLC 
differentiation; n = 12. c, Diagram of the tamoxifen-inducible Otx2 cell 
line, carrying an Otx2-ER” fusion protein transgene and replacements 

of endogenous Ofx2 alleles by GFP or LacZ. d, Otx2!#°7/G?.:Otx2ER™ 
cells were assessed by flow cytometry for surface expression of SSEA1 and 
CD61 at day 6 of PGCLC differentiation, either without tamoxifen or with 
tamoxifen from days 0-2; n = 4 biologically independent replicates. 

e, RT-PCR of PGC transcription factors. Expression levels are normalized 
to TBP; values are means + s.d.; n = 3 biologically independent replicates. 


in multiple Otx2~'~ cell lines (Fig. 3e, Extended Data Fig. 7e, f). 
Nevertheless, at day 6 of PGCLC differentiation, Otx2~! ~Blimp1 =I 
cells were unable to adopt the mature PGCLC transcriptome observed 
in Otx2*!* or Otx2-' Blimp1*'* cells (Fig. 3c, Extended Data Fig. 6). 
These results indicate that without OTX2, BLIMP1 is not required for 
phenotypic aspects of germline entry, but is required for a fully mature 
PGC phenotype. 

Although EpiLCs respond to cytokine induction by PGCLC dif- 
ferentiation, this is not maintained upon continued passaging in 
medium containing Activin/FGF”, suggesting that in these conditions, 
cells lose germline competence. To assess whether OT X2 affects the 
period during which pluripotent cells remain competent for germline 
entry, EpiLCs were passaged in EpiLC medium for a further two days 
(Fig. 4a). At this point the Oct4APE::GFP reporter is inactive in both 
Otx2”- and Otx2-'~ cells (Extended Data Fig. 9a). Cells were then 
transferred to PGCLC differentiation medium for 6 days. Notably, 
Oct4APE::GFP was reactivated robustly in Otx2~'~ but not Otx2!”— 
cells (Fig. 4b). Moreover, whereas all cell lines expressed Blimp1 and 
Ap2\ mRNAs, CD61/SSEA1 surface expression and Prdm14 and Nanog 
mRNA expression were observed only in Otx2~/~ and not in Otx2t/t 
or Otx2!”~ cells (Extended Data Fig. 9b-d). This indicates that in the 
absence of OTX2 the period of competence to enter the germline is 
extended. 

To determine whether Otx2~/~ cells exhibit an increased propen- 
sity to enter the germline in vivo, Otx2'/* or Otx2~'~ ESCs constitu- 
tively expressing GFP were compared in chimaeras following morula 
aggregation (Fig. 4c). Otx2t/* and Otx2~'~ cells had a comparable 
capacity to produce chimaeras (Extended Data Fig. 9e, g). However, 
an enhanced proportion of Otx2~/~ cells expressed BLIMP1 or SOX2 
(Fig. 4d, Extended Data Fig. 9f), indicating that enhanced germline 
entry is a cell autonomous property of Otx2~/~ cells. 
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Fig. 3 | Otx2-null cells can access the germline independently 

of cytokines and BLIMP1. a, Representative morphologies and 
Oct4APE::GFP expression in aggregates at day 4 of PGCLC differentiation 
without cytokines. Percentages indicate GFP-positive cells; (n = 7); scale 
bar, 200 pm. b, E14Tg2a and Otx2~'~ cells were assessed by flow cytometry 
for surface expression of SSEA1 and CD61 at day 6 of differentiation in the 
absence of PGCLC cytokines; (n = 9). c, Ternary plot analysis (microarray 
data from three biologically independent replicates under seven different 
conditions) comparing transcriptomes of EpiLCs (top) or day 6 PGCLCs 
(bottom). Circles represent probes, with colour indicating the probe 
density. Differentiations performed in the presence or absence of cytokines 
are indicated (+/— cyt). WT, E14Tg2a; O-/”," Otx2!a2/GFP; Q-/- Bo, 
Otx2!**2/GFP Blimp1~’~. d, Cells of the indicated genotypes were 

assessed by flow cytometry for surface expression of SSEA1 and CD61 at 
day 6 of PGCLC differentiation; n= 2 for 1 clone of each genotype. Two 
further clones of each genotype are shown in Extended Data Fig. 7. 

e, Cytokine-free PGCLC differentiation. Representative morphologies 

and Oct4APE::GFP expression of aggregates at day 6. Percentages indicate 
GFP-positive cells; n =2 for one clone of each genotype; scale bar, 200 jum. 


To determine whether Otx2~/~ embryos also showed enhanced PGC 
numbers, Otx2*/— mice!> were inter-crossed and Otx27!— embryos ana- 
lysed at E7.5. These embryos show strong developmental defects!>'® 
but also showed increased PGC numbers (Fig. 4e, Extended Data 
Fig. 10a, b), confirming that in vivo, OTX2 acts as a negative regulator 
of the PGC programme. 

Previous studies have shown that, in mice, competence to enter the 
germline exists transiently in embryos from E5.5—-E6.25'. In wild-type 
cells, germline entry requires BMP4 signalling from the extraembry- 
onic ectoderm? and is critically dependent on the downstream action 
of BLIMP1?. Our work identifies OT X2 as an intermediary fulcrum 
in these processes (Fig. 4f). During the period of PGC competence, 
BMP4 represses Otx2 expression, partly by endogenous Wnt activa- 
tion (Fig. 4g). This reduction in OTX2 is necessary for expression of 
the PGC transcription factors BLIMP1, PRDM14, AP2 and NANOG 
as enforcing OTX2 activity prevents their expression. Moreover, the 
rate of Otx2 decline appears important as without cytokines, germline 
entry is diminished. We propose that without cytokines, the window 
for germline entry closes before OTX2 is reduced below a threshold 
necessary for PGC transcription factor expression. Furthermore, in 
the absence of OTX2, PGC transcription factor expression does not 
require BMP4, indicating that BMP4 functions by repressing Otx2. 
Otx2-'~ cells also exhibit an extended competence period, suggesting 
that OT X2 starts a process that defines the extent of the competence 
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period. Also, as Otx2~/~ cells can initiate PGC differentiation without 
BLIMP], this suggests a reciprocal relationship between BLIMP1 and 
OTX2, in which BLIMP1 represses”)? ° and OTX2 activates somatic 
gene expression!’”. Supporting this, during PGCLC differentiation, 
Otx2~/~ cells do not activate mesendoderm genes!” (Extended Data 
Fig. 4c) that are otherwise repressed by BLIMP1*°. This may explain 
why some aspects of the PGCLC differentiation phenotype can be 
divorced from a BLIMP1 requirement in Otx2~/~ cells. This places 
OTX2 at a developmental crossroads where it acts to control excessive 
access to the germline (Fig. 4f). 

Finally, our findings are noteworthy in light of the hypothesis 
that the neural lineage is the default developmental pathway for 
vertebrate cells*!. Interestingly, neural induction requires inhibition 
of BMP signalling”. BMP is a known facilitator of germline entry” and 
is identified here as a key Otx2 repressor. The default neural induction 
hypothesis is based principally on studies in chicks and frogs, species 
in which PGCs are formed by germplasm segregation. Yet, induced 
germline segregation is considered the ancestral mechanism that pre- 
dates the recurrent evolution of germplasm”. The highly efficient entry 
of pluripotent cells into the germline in the absence of OTX2 reported 
here suggests that the germline may be the ancient default option that 
must be overcome in order to elaborate the ancillary structures of the 
soma. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized, and the investigators were not blinded to 
allocation during experiments and outcome assessment. All mouse studies were 
performed in accordance with UK Home Office regulations under project licence 
PPL 60/4435 and carried out in a Home Office-designated facility. 

Cell culture and differentiation. ESCs were routinely cultured in GMEMB sup- 
plemented with 100 U mI"! LIF and 10% FCS at 3-10 x 10° cells per cm? density 
on tissue culture flasks coated with 0.1% gelatin**. Cells were routinely tested for 
mycoplasma contamination. EpiLC differentiation and PGCLC induction were 
carried out according to protocol previously described!?°*4. 

For cytokine-free differentiation, EpiLCs were dissociated and resuspended at 
8 x 10 cells ml"! in GK15 medium (GMEMB supplemented with 15% KOSR). 
25 11 drops containing 2,000 cells were plated on the lids of tissue culture dishes 
and incubated over a reservoir of PBS for 2 days. Hanging drops were then collected 
and transferred to untreated culture dishes supplemented with GK15 medium and 
rotated at 72 r.p.m. Fresh medium was replenished every other day. CHIR (3 1M) 
and XAV939 (21M) were used to induce and repress Wnt signalling. 
Generation of knockout cell lines. For a summary of cell lines used in this report 
and a history of their derivation, see Extended Data Figure 1. 

Otx2-knockout (Otx2!*°4/GFP) and Otx2 conditional knockout (Otx2!4°2"; 
Rosa26“**'+) cell lines and targeting strategies have been described previously!”. 
To derive Otx2-knockout (Otx2!) lines from Otx2!#2", Otx2!#°2/ ESCs were 
treated with 1|1M tamoxifen for 24 h, plated at clonal density and after 6 days, 
individual colonies picked, expanded and genotypes verified with RT-PCR, western 
blot and immunostaining. 

For knockout cell lines derived using CRISPR/Cas9 technology, ESCs were 

transfected with vectors expressing gRNAs and eSpCas9-T2A-mCherry or eSp- 
Cas9-T2A-eGFP using Lipofectamine 3000 (Invitrogen, L3000015) and cultured 
for 24 h. After FACS sorting, single-cell clones were expanded and genotyped. 
Clones with predicted genotypes were further verified with qPCR or western blot 
and immunostaining. See Supplementary Table 1 for detailed oligonucleotide 
sequences of gRNAs, genotyping PCR. 
Immunohistochemistry. For whole-mount staining of embryos and PGCLC 
aggregates, embryos and PGCLC aggregates were fixed with 4% paraformaldehyde 
(PFA) in PBS (room temperature, 30 min). For Fragilis staining, embryos were 
washed three times in PBS/0.1%BSA and blocked overnight at 4°C in 3% donkey 
serum (Sigma)/1% BSA/PBS solution. For other antibodies, samples were washed 
three times with PBS/0.1% TritonX100 (PBST), permeabilized in 0.5% Triton 
X100/PBS solution for 15 min and incubated in 1M glycine in PBST for 20 min. 
After three washes, samples were blocked overnight at 4°C using 3% donkey 
serum, 1% BSA (Sigma) in PBST (blocking buffer). Samples were then incubated 
for 72 h with diluted primary antibodies, rinsed four times for 20 min each in 
PBST and incubated for 4 h at room temperature with diluted secondary antibody. 
Samples were then rinsed four times for 20 min each in PBST. DAPI (Molecular 
Probes, D1306) was used for nuclear staining (same for other staining). For imag- 
ing, embryos were treated with 10%, 25%, 50%, 97% thiodiethanol (Sigma, 166782) 
for 5 min in each gradient and were imaged using the TCS SP8 inverted confocal 
microscope (Leica). 

For staining of frozen sections, aggregations were fixed in 4% PFA (room 
temperature, 20 min). After washing in PBS, samples were embedded in Tissue- 
Freezing medium (Thermo Fisher scientific, 6502Y) and sectioned at a thickness of 
5\um. For antigen retrieval, sections were microwaved with highest power in 10 mM 
sodium citrate (PH, 6.0) for 2 min 30 s, twice. After antigen retrieval, samples 
were incubated in blocking buffer (room temperature, 1 h), then incubated with 
primary antibodies (4°C overnight), after being rinsed three times in PBST for 
10 min, incubated for 2 h at room temperature with diluted secondary antibody. 
Sections were then covered by cover slip in Fluoromount (SouthernBiotech, 0100-01) 
and imaged using the TCS SP8 inverted confocal microscope (Leica). 

For immunostaining of cells grown in monolayer, cells were washed once with 
PBS, fixed in 4% PFA (room temperature, 10 min), permeabilized in 0.3% Triton 
X100/PBS (room temperature, 20 min), incubated in blocking buffer (room 
temperature, 1 h), before addition of primary antibodies and incubation at 4°C 
overnight. Cells were washed in PBST (four times, 5 min) before incubation with 
diluted secondary antibodies (room temperature, 1 h). After washing four times 
(PBST), cells were imaged using the TCS SP8 inverted confocal microscope (Leica). 

For cytospin staining, cells or aggregates were dissociated into single cells. 
1 x 10° cells in 100,11 1% BSA/PBS buffer were added into sample holders, centri- 
fuged in the cytospin machine (5 min, 1200 r.p.m.). Microscope slides were then 
taken from the holders and cells circled with hydrophobic marker pen. Staining 
was then finished following the same protocol as for monolayer cells. 

For PGC cell counting in wild-type and Otx2-null embryos, embryos 
were isolated at E7.5, washed in PBS, fixed overnight in 4% PFA/PBS, dehydrated 
and paraffin embedded as previously reported!*. Embryos were sectioned in 


coronal-frontal or sagittal sections and processed for immunohistochemis- 
try (IHC) with antibodies against BLIMP1, AP2y (Santa Cruz Biotechnology, 
SC-53162) and Fragilis (R&D Systems, AF3377). All sections were analysed and 
those including PGCs captured for cell-counting analysis. 

If not specified, primary and secondary antibodies used are listed in 
Supplementary Table 2. 
Immunofluorescence quantification. Cytospin slides were stained at the same 
time and imaged using a Zeiss Observer microscope (Zeiss), Plan-Apo 20x NA, 
0.8 objective (Zeiss), a Hamamatsu ORCA-Flash4.0 V3 camera (Hamamatsu), a 
Colibri 7 (Zeiss) light source and the following filter cubes (name, excitation LED, 
beam splitter, band pass emission filter; 49 DAPI, 395, 480/40; 38 HE GFP, 495, 
525/50; 43 HE dsRed, 570, 605/70; Cy5, 660, 700/775). Images were analysed using 
CellProfiler software (version 2.2.0)*°. DAPI staining was first used to segment 
individual nuclei based on the diameter and intensity of the objects (25-90 pixel 
units and intensity threshold >0.05 respectively), then the intensity of OTX2 and 
AP2- of the segmented nuclei were measured and the mean intensity of each chan- 
nel reported. Over 8,000 segmented nuclei of each samples were analysed. The 
data were analysed in R and plots were generated using the ggplot2 package. The 
Otx2-knockout EpiLCs are negative for both OTX2 and AP2- staining and there- 
fore were used to set up the threshold for gating OTX2 and AP27 for all samples 
(negative gate > 99.5%). 
Flow cytometry. FACS analysis was performed as described”®. Cells grown in 
monolayer or embryoid bodies in suspension were dissociated into single cells 
with trypsin and neutralized in PBS/10%FCS. A maximum of 5 x 10° cells were 
collected by centrifugation and the pellet resuspended in 100,11 PBS/10%FCS sup- 
plemented with Alexa Fluor 647 anti-mouse/human CD15 (SSEA-1) (Biolegend, 
125608) and PE anti-mouse/rat CD61 (Biolegend, 104307) diluted 1/200 and 
1/500, respectively, and incubated (30 min, 4°C). Cells were washed twice in 1 ml 
PBS/10% FCS before analysis on a BD Fortessa 5 laser LSRII. Gate strategy is shown 
in Extended Data Fig. 3a. 
SDS-PAGE electrophoresis and immunoblotting. Immunoblot analysis was 
performed as described**. Briefly, protein samples from cell lysates, along with 
protein ladder (Novex, cat. LC5925) were loaded on 10% Bis-Tris Gels (Novex, 
cat. BG00102BOX) and electrophoresis performed at 200 V for 60-80 min. 
Proteins were then transferred onto Nitrocellulose membrane (Capitol Scientific, 
cat. 10401396), blocked in 10% milk (w/v) (room temperature, 1 h), followed by 
incubation with primary antibodies (4°C, overnight). After three washes in PBST, 
membranes were incubated with IRDye conjugated secondary antibody, followed 
by three washes in PBST before being visualized in LI-COR Odyssey Imaging 
Systems. Primary and secondary antibodies are listed in Supplementary Table 2. 
RNA analysis. Total RNA was extracted using either Trizol (Invitrogen, 15596026) 
or RNeasy micro (Qiagen, 74034) or mini kit (Qiagen, 74104) following the manu- 
facturer’s instructions, followed by DNase treatment (Qiagen, 157047207). Reverse 
transcription reactions were performed using Superscript First Strand Synthesis 
kit (Invitrogen, 1964419) and quantitative PCR performed using SYBR Green Kits 
(Takyon, UF-NSMT-B0701) on Roche LightCycler 480 with cDNA equivalent of 
25 ng total RNA per reaction. Values for each gene were normalized to expression 
of TATA-box Binding Protein (TBP) according to 2-4 formula?”. Oligonucleotide 
sequences are shown in Supplementary Table 3. 
Transcriptomic profiling and data analysis. Total RNAs were extracted using 
RNeasy Plus Micro Kit (Qiagen, cat.74034), following manufacturer's instructions. 
RNAs were labelled using Illumina TotalPrep RNA Amplification Kit (Illumina, 
cat. AMIL1791) following manufacturer's instructions. Briefly, 100 to around 
300 ng total RNA were reverse transcribed using oligo(dT) primers, followed by 
second-strand cDNA synthesis. The dsDNAs were then in vitro transcribed into 
biotin labelled complementary RNA (cRNA) at 37°C for 12 h. The cRNAs were 
purified and quality analysed using Agilent Bioanalyzer 2100. The hybridizations 
were performed by the Eurofins Genomics AROS in Denmark. The samples were 
hybridized on Illumina MouseWG-6 v2 BeadChip. A signal intensity (expression) 
value and a detection P value (whether the signal is above background) for each 
probe on the array was obtained using Illumina GenomeStudio software with 
standard settings. A total of 13,683 probes which had detection P value < 0.01 in 
at least three experiments were considered for further analysis. A variance-based 
filtering of gene expressions was performed using the genefilter package with 
variance cutoff of 0.75, leaving 3,421 probes for further statistical analysis. 
Unsupervised hierarchy clustering, generation of heatmap and PCA and ternary 
plot were carried out using R (https://www.R-project.org) using the following pack- 
ages: cluster, pheatmap, FactoMineR, ggtern and RColorBrewer. The microarray 
data has been deposited in the GEO database under accession number GSE116640. 
Embryo aggregation. E2.5 embryos were collected from superovulated F1 (CBA 
male cross with C57BL/6J female) females with M2 buffer (Sigma, M1767). After 
being cultured in KSOM (Millipore, MR-020P-5F) for 15 min, zonae pellucidae 
were removed by Tyrode's solution (Sigma, T1788) and embryos cultured in KSOM 
in aggregation plates prepared by aggregation needle (BLS, DN-10). Wild-type 
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E14Tg2a ESCs and two independent Otx2-knockout cell lines (labelled with Flag- 
GEP) were used for aggregation. Cells were trypsinized (37°C, 1 min), washed and 
re-suspended with GMEM/FCS medium. Each embryo was aggregated with 6-8- 
cell clumps and cultured in the incubator. The next day, good-quality blastocysts 
were picked and transferred to E0.5 CD1 recipient*®. 

Reporting summary. Further information on research design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability 
All the data sets generated or analysed during the current study are available from 
the corresponding author on reasonable request. 
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Extended Data Fig. 1 | Summary of cell lines used in this report. 
Otx2!#°Z/GFP and Otx2!#“ ESCs have been described previously!” 
Summarized below are further modifications to Otx2 or Blimp1, or 
transgene additions in the above or wild-type backgrounds. Further 
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schematic details illustrating the points of Cas9 modification of Otx2 
or Blimp1 and genotype verification of derived cell lines are shown in 
Extended Data Fig. 3 and Extended Data Fig. 7, respectively. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Competence for germline entry is preceded by 
downregulation of OTX2 protein. a, Representative cytospin images 

of OTX2, BLIMP] and AP2 staining using E14T g2a aggregates after 

1 day or 2 days of PGCLC differentiation; n = 2; scale bar, 100 1m. 

b, Whole-mount immunofluorescence of E14Tg2a aggregates after 1 (D1) 
or 2 days (D2) of differentiation of EpiLCs in the presence or absence 

of cytokines. Representative images of OTX2 and BLIMP] are shown; 
n= 3; scale bar, 50 jum. c, Magnified image of the region highlighted in b; 
scale bar, 10,1m. d, Quantitative transcript analysis of Otx2 in E14Tg2a 
cultures with (n = 4) or without cytokines (n =7) at indicated time 
point. Schematic illustration is shown in Fig. 1b. Expression levels are 


normalized to TBP; values are means +s.d. e, Top, primers used for Otx2 
pre-mRNA transcript analysis are shown relative to the primary transcript 
structure. Bottom, quantitative transcript analysis of Otx2 pre-mRNA at 
the indicated times (minutes) after changing E14Tg2a EpiLCs into PGCLC 
medium. Expression levels are normalized to TBP and shown relative to 
expression at t= 0; values are means + s.d.; n = 3 biologically independent 
replicates. f, Assessing the temporal requirement of cytokine treatment for 
efficient PGCLC induction. Aggregates of E14Tg2a EpiLCs treated with 
cytokines for 1 (d0-d1), 2 (d0-d2) or 6 days (d0-d6) were assessed by flow 
cytometry for surface expression of SSEA1 and CD61 at day 6 of PGCLCs 
differentiation; n =3. 
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Extended Data Fig. 3 | Independent Otx2~/~ clones show enhanced 
PGCLC induction efficiency. a, The gating strategies for analysing 
PGCLCs by flow cytometry. Cells were first gated based on the FSC (size) 
and SSC (complexities) scatter plot, followed by selection for singlets 
based on linear correlations between FSC-area and FSC-height. Live cells 
were then gated based on exclusion of DAPI to indicate cell membrane 
integrity. Live cells were then analysed for SSEA1 and CD61. Cells stained 
for fluorescence minus one (FMO) were used to set gates; stained and 
non-stained cells are also shown. b, Otx2!@°" and Otx2 !*°2/" cells with the 
Oct4APE::GFP reporter (two independent clones each) were assessed 

by flow cytometry for surface expression of SSEA1 and CD61 at day 6 

of PGCLC differentiation. For clone 5 and clone 1, n= 2; for clone 11 


and clone 2, n= 9. c, Diagram showing the gRNA sequence (in red) and 
targeting strategy for generating Otx2-knockout cell lines. Red arrows 
represent genotyping primers used for screening clones. d, Immunoblot 
analysis of OTX2 protein expression in EpiLCs of E14Tg2a and three 
Otx2~/~ clones. Experiment preformed once. e, E14Tg2a and three 
independent Otx2~/~ clones generated by CRISPR/Cas9 were assessed 
by flow cytometry for surface expression of SSEA1 and CD61 at day 6 

of PGCLC differentiation. Two biologically independent experiments 
for clone cll, one for clone c17 and c19. f, Q-RT-PCR of epiblast 
markers during the time-course outlined in Fig. 1b. Expression levels are 
normalized to TBP; values are means + s.d.; n= 3 biologically independent 
replicates. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | OT X2 restricts PGC specification during 

the first two days of induction. a, OTX2 immunofluorescence of 
Otx2!#-2/GFP.-Ctx2ER™ ESCs before or after treatment with tamoxifen for 
20 min; n= 2 biologically independent experiments; scale bar, 20 jum. 

b, Quantitative transcript analysis of T (Brachyury) during the time- 
course outlined in Fig. 1b in basal GK15 medium supplemented with the 
indicated cytokines. Expression levels are normalized to TBP; values are 
means +s.d.; n =3 biologically independent replicates. c, Quantitative 
transcript analysis of T (Brachyury), Hoxal and Hoxb1 during the time- 
course outlined in Fig. 1b in indicated cell lines. Expression levels are 
normalized to TBP; values are means from two biologically independent 
replicates. d, Left, scheme illustrating the strategy for induction or 
repression of Wnt signalling. E14Tg2a EpiLCs were aggregated in 

the indicated media and transcripts analysed at 0, 9 and 18 h. Right, 
quantitative transcript analysis of T (Brachyury), Hoxal, Hoxb1 and 


Otx2 during the time-courses outlined on the left. Expression levels are 
normalized to TBP; values are means + s.d.; n = 3 biologically independent 
replicates. e, Quantitative transcript analysis of T (Brachyury), Otx2, 
Blimp1 and Prdm14 during E14Tg2a differentiation in three different 
media conditions (GK15, without cytokines; PGCLC, GK15 with 
cytokines; PGCLC +XAV, GK15 with cytokines and with XAV939) at the 
indicated time point. Expression levels are normalized to TBP; values are 
means + s.d.; n =3 biologically independent replicates. f, Whole-mount 
immunofluorescence analysis of AP2y and T (Brachyury) in E14Tg2a 

and Otx2!#2/GFP day 2 (D2) PGCLC aggregates; n = 2 biological replicates; 
scale bar, 50\1m. g, Scheme illustrating tamoxifen administration schemes. 
h, Otx2!*2/CFP--Otx2ER™ cells were assessed by flow cytometry for 

surface expression of SSEA1 and CD61 at day 6 of PGCLC differentiation 
following the tamoxifen treatment regime outlined (g); n = 2 biological 
replicates. 
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Extended Data Fig. 9 | OT X2 safeguards somatic lineages. 

a. Representative morphologies and Oct4APE::GFP expression of EpiSCs 
after one passage from EpiLCs (n= 3 for 1 clone of each genotype); scale 
bar; 200 um. b, c, Flow cytometry analysis for surface expression of SSEA1 
and CD61 at day 6 of PGCLC differentiation, initiated from EpiSCs after 
one passage from EpiLCs. One experiment for c5 and cl and 6 biologically 
independent replicates for C11 and C2 (b); n =6 biologically independent 
replicates (c). d, Quantitative transcript analysis of PGC transcription 
factors in the indicated cell lines. Expression levels are normalized to 
TBP; values are means + s.d.; n = 3 biologically independent replicates, 
each dot represents the value from one experiment. e, Comparison of 


the frequency of degree of chimaerism (top) and the germ cell numbers 
(bottom, centre lines and error bars represents means + s.d.) in E7.5 
chimaeric embryos formed using wild-type or Otx2-null ESCs. P value 
(two-tailed unpaired t-test, 0.95 confidence intervals) is indicated. High, 
>70%; moderate, 30-70%; low, <30%. f, Bright-field and representative 
images of E7.5 chimaeric embryos formed by wild-type host embryos and 
GFP-labelled Otx2t/*+ (n= 6), Otx2~!~ (n=9) or Otx2!**2/GFP (7 = 9) ESCs 
assessed for GFP and BLIMP1/SOX2 expression, with magnified images of 
the proximal posterior regions. The proportion of BLIMP1-positive cells 
expressing GFP in the embryos is indicated; scale bar, 100 1m (left), 201m. 
g, Summary of embryo aggregations. 
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Extended Data Fig. 10 | Otx2-null embryos exhibit increased number of 
PGCs. a, b, Frontal-coronal (a) and sagittal (b) sections of wild-type and 
Otx2~/~ E7.5 embryos stained with BLIMP1, AP27 and Fragilis to detect 
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PGCs. All sequential sections spanning the PGCLCs niche are shown; 
scale bar, 50\1m. The experiments were repeated in four wild-type and 
three Otx2-knockout embryos. 
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A metabolite-derived protein modification 
integrates glycolysis with KEAP1-NRF2 signalling 


Michael J. Bollong’*®, Gihoon Lee**°, John S. Coukos**, Hwayoung Yun", Claudio Zambaldo!, Jae Won Chang**, Emily N. Chin!, 
Insha Ahmad!, Arnab K. Chatterjee*, Luke L. Lairson)**, Peter G. Schultz** & Raymond E. Moellering?3* 


Mechanisms that integrate the metabolic state of a cell with 
regulatory pathways are necessary to maintain cellular homeostasis. 
Endogenous, intrinsically reactive metabolites can form functional, 
covalent modifications on proteins without the aid of enzymes!”, and 
regulate cellular functions such as metabolism? and transcription®. 
An important ‘sensor’ protein that captures specific metabolic 
information and transforms it into an appropriate response is 
KEAP1, which contains reactive cysteine residues that collectively 
act as an electrophile sensor tuned to respond to reactive species 
resulting from endogenous and xenobiotic molecules. Covalent 
modification of KEAP1 results in reduced ubiquitination and the 
accumulation of NRF2”*, which then initiates the transcription of 
cytoprotective genes at antioxidant-response element loci. Here we 
identify a small-molecule inhibitor of the glycolytic enzyme PGK1, 
and reveal a direct link between glycolysis and NRF2 signalling. 
Inhibition of PGK1 results in accumulation of the reactive 
metabolite methylglyoxal, which selectively modifies KEAP1 to 
form a methylimidazole crosslink between proximal cysteine and 
arginine residues (MICA). This posttranslational modification 
results in the dimerization of KEAP1, the accumulation of NRF2 


and activation of the NRF2 transcriptional program. These 
results demonstrate the existence of direct inter-pathway 
communication between glycolysis and the KEAP1-NRF2 
transcriptional axis, provide insight into the metabolic regulation 
of the cellular stress response, and suggest a therapeutic strategy 
for controlling the cytoprotective antioxidant response in several 
human diseases. 

In line with its role in responding to altered cellular conditions, 
numerous studies have linked deregulated KEAP1-NRF2 signalling 
in disease, including cancer’, neurodegenerative disorders’®, chronic 
inflammatory diseases'', diabetes!” and ageing'’. Efforts to target 
NRF? signalling therapeutically have largely focused on covalent small- 
molecule agonists of KEAP1, including the clinical candidate bardox- 
olone methyl (BARD)!“. To discover noncovalent modulators of the 
KEAP1-NRF2 signalling axis, as well as potentially new mechanisms 
of action for its regulation, we performed a cell-based, high-throughput 
phenotypic screen using a NRF2-dependent luciferase reporter 
(pTI-ARE-LUC) in IMR32 cells'®. From a library of diverse hetero- 
cyclic compounds, we identified a hit compound, CBR-470-0, that 
did not contain any obvious electrophilic groups, and induced the 
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Fig. 1 | CBR-470-series compounds activate NRF2 signalling in vitro 
and in vivo. a, Structure of CBR-470-1. b, NRF2 protein levels from 
IMR32 cells treated with the indicated concentrations of CBR-470-1 for 
4h (top) or 541M CBR-470-1 for the indicated time periods (bottom). 
Blots are representative of three independent experiments. c, Gene set 
enrichment analysis (GSEA) plot depicting the enrichment of a NRF2 
target gene set (‘Singh _NFE2L2_Targets’ in MSigDB) from IMR32 cells 
treated for 24 h with 541M CBR-470-1 (n= 3, P< 0.0001, nominal P value 
in GSEA). d, Heat map representation of the leading-edge subset of the 
most upregulated NRF2-regulated transcripts after CBR-470-1 treatment. 
Data are biologically independent samples. e, Relative Nqgol and Hmox1 


transcript levels in mouse skin tissue 24 h after the indicated oral doses of 
CBR-470-2 (n= 6, biologically independent samples). f, Quantification of 
wounded area by automated image analysis from animals of the indicated 
treatment groups at study end (day 10). g, Quantification of epidermal 
thickness from haematoxylin and eosin (H&E)-stained sections from the 
indicated groups at study end. h, Representative images of H&E-stained 
skin sections from animals euthanized at day 10 of the study. CBR-470-2, 
50 mg kg twice a day, orally. BARD, 3 mg kg’ twice a day, orally. UV, 
200 mJ cm~?. Data are mean and s.e.m., 1 = 8 animals. Statistical analyses 
are by one-way analysis of variance (ANOVA) with Dunnett's correction 
(e-g). 
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Fig. 2 | CBR-470-1-dependent inhibition of glycolysis activates NRF2 
signalling. a, Anti-biotin western blot analysis of IMR32 cells treated 
with CBR-470-PAP (101M) for 1 h and exposed to UV light to induce 
photocrosslinking (representative shown from n = 4 biological replicates). 
b, Transient transfection of shRNA constructs targeting PGK (sh10 

and sh47) in HEK293T cells activates the ARE-LUC reporter. PGK1 and 
6-actin protein levels shown from representative experiments (n =4 
biological replicates). c, Viral shRNA knockdown of PGK1 induces NQO1 
mRNA levels in IMR32 cells. PGK1 and tubulin protein levels are shown 
from representative experiments (n = 3). d, e, CBR-470-1 activation of 
ARE-LUC reporter in HEK293T cells with transient knockdown (d) or 
overexpression (e) of PGK1 demonstrates opposing effects on compound 
potency. PGK1, actin and tubulin protein levels are shown from 
representative experiments (n= 3). ARLU, change in relative light units. 
f, Heat map depiction of relative metabolite levels in IMR32 cells treated 


transcriptional activity of antioxidant-response elements (AREs) 
to a similar magnitude to the previously reported NRF2 activators 
tert-butylhydroquinone (TBHQ) and AI-1 (Extended Data Fig. la-c). 
Structure-activity relationship analysis around the cyclic sul- 
fone scaffold led to the identification of CBR-470-1 (Fig. 1a), an 
isobutylamine-substituted analogue that was not reactive in glutathione 
assays and had a half-maximum effective concentration (ECs9) value 
of approximately 1 1M in the cellular ARE-LUC assay (Extended 
Data Fig. 1d, e). CBR-470-1 treatment resulted in a dose- and time- 
dependent accumulation of NRF2 protein in IMR32 cells (Fig. 1b), 
and increased both mRNA and protein levels of the NRF2-responsive 
genes NQO1 and HMOX1 (Extended Data Fig. 1f, g; Extended Data 
Table 1). Expression profiling of IMR32 cells exposed to compound 
for 24 h revealed that the most significantly enriched gene set was 
‘NFE2L2 targets, which consisted of NRF2 target genes (Fig. 1c, d); 
the expression changes of these target transcripts were validated by 
focused quantitative reverse transcription PCR (qRT-PCR) analysis 
(Extended Data Fig. 1f). CBR-470-1 also induced transcript levels 
of NQO1 and HMOX1, and enhanced NRE2 protein stabilization in 
HEK293T, SH-SY5Y and primary human lung fibroblasts (Extended 
Data Fig. 1h, i), indicating that these effects are not restricted to a spe- 
cific cell type. Genetic depletion of NRF2 protein using short hairpin 
RNA (shRNA) inhibited the ability of CBR-470-1 and TBHQ to induce 
luciferase expression, indicating that CBR-470-1 activity is dependent 
on NRF2 (Extended Data Fig. 1j). Finally, CBR-470-1 treatment 
induced a cytoprotective NRF2 phenotype in vitro, as shown by the 


for 30 min with CBR-470-1 (left) or viral shRNA knockdown of PGK1 
(right) relative to DMSO and scramble shRNA controls, respectively. BPG 
refers to both 2,3-BPG and 1,3-BPG, whereas 1,3-BPG refers specifically 
to the 1,3-isomer. 3-PG; 3-phosphoglycerate; Cit, citrate; FBP, fructose- 
1,6-bisphosphate; G6P, glucose 6-phosphate; GAP, glyceraldehyde-3- 
phosphate; Glc, glucose; Glu, glutamate; GSSG, glutathione disulfide; 

Lac, lactate; NADP*, nicotinamide adenine dinucleotide phosphate; 
NAD‘, nicotinamide adenine dinucleotide; NADH, nicotinamide adenine 
dinucleotide (reduced); NADPH, nicotinamide adenine dinucleotide 
phosphate (reduced); PEP, 2-phosphoenolpyruvate; PPP, pentose 
phosphate pathway; RSP, ribose-5-phosphate; Succ, succinate; TCA, 
tricarboxylic acid. g, ARE-LUC reporter activity in IMR32 cells co-treated 
with CBR-470-1 (541M) and 2-deoxyglucose (2DG) for 24 h (n= 12). 
Statistical analyses are by univariate two-sided t-tests (b, c, g). Data are 
mean and s.d. of biologically independent samples. 


protection of SH-SY5Y cells challenged with the cell-permeable perox- 
ide tert-butyl hydroperoxide (TBHP) (Extended Data Fig. 1k). 

We next sought to determine whether CBR-470-1 or related ana- 
logues induce the activation of NRF2 signalling in vivo. Oral dosing of 
BALB/c mice with 20 or 100 mg kg™' (twice a day, orally) ofa glycine- 
substituted analogue, CBR-470-2, which is equally potent to CBR-470-1 
and has more favourable bioavailability (Extended Data Fig. 2a-e), 
induced NRF2 target gene expression in several organs, with dose- 
dependent increases in Ngo1 and Hmox1 transcript levels observed in 
the skin (Fig. le). Published studies in Nrf2 (also known as Nfe2/2)- 
knockout mice have demonstrated that NRF2 is essential to protect 
against photo-ageing phenotypes and skin carcinogenesis’® resulting 
from ultraviolet (UV) irradiation’’. We therefore evaluated CBR-470-2 
activity in this acute UV damage mouse model. Mice were prophylacti- 
cally dosed with CBR-470-2 (50 mg kg™!, twice a day, orally) or BARD 
(3 mg kg“, orally) for five days before exposure to a single dose of UV 
irradiation. Mice were then dosed for an additional five days, eutha- 
nized and UV damage was quantified. CBR-470-2 and BARD treatment 
resulted in comparable beneficial effects on erythema histological scores 
and total wounded area (Fig. 1f, Extended Data Fig. 2f, g). Both CBR- 
470-2 and BARD were also found to decrease epidermal thickness in 
response to UV exposure, consistent with activation of the NRF2 cyto- 
protective program'® (Fig. 1g, h). The combined pharmacodynamic and 
efficacy data indicate that CBR-470-2 treatment is capable of modulat- 
ing NRF2 signalling in vivo, despite this compound series operating via 
an apparent mechanism that is independent of direct KEAP1 binding. 
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Fig. 3 | Methylglyoxal modifies KEAP1 to form a covalent, high- 
molecular mass dimer and activate NRF2 signalling. a, Time-course, 
anti-Flag western blot analysis of whole-cell lysates from HEK293T cells 
expressing Flag~KEAP1 treated with DMSO or CBR-470-1. b, Western 
blot monitoring of Flag-KEAP1 migration in HEK293T lysates after 
incubation with central glycolytic metabolites in vitro (1 and 5 mM, left 
and right for each metabolite). c, Flag-KEAP1 (red) and 6-actin (green) 
from HEK293T cells treated with MGO (5 mM) for 8 h. d, Relative NQO1 
and HMOX1 mRNA levels in IMR32 cells treated with MGO (1 mM) or 
water control (n = 3). e, LC-MS/MS quantitation of cellular MGO levels 

in IMR32 cells treated with CBR-470-1 relative to DMSO (n= 4). f, ARE- 
LUC reporter activity in HEK293T cells with transient shRNA knockdown 
of GLO1 (n=8). Statistical analyses are by univariate two-sided t-test (d, f). 
Data are mean + s.e.m. of biologically independent samples. 


To determine the mechanism by which CBR-470-1 activates NRF2 
signalling, we generated a photo-affinity probe that contained biotin 
and diazirine substituents, termed CBR-470-PAP, which retained cellu- 
lar activity in ARE-LUC reporter assays (EC59 = 2.4|1M; Extended Data 
Fig. 3a, b). Treatment of IMR32 cells with 5 4M CBR-470-PAP for 1h, 
followed by UV irradiation and subsequent anti-biotin western blot anal- 
ysis of cellular lysates (Fig. 2a), together with biochemical fractionation 
and liquid chromatography-tandem mass spectrometry (LC-MS/MS) 
analysis, identified the enzyme phosphoglycerate kinase 1 (PGK1) as a 
potential target of CBR-470-PAP (Extended Data Fig. 3c). In vitro bind- 
ing experiments with recombinant protein revealed that CBR-470-PAP 
selectively labelled PGK1, which was blocked with the soluble compet- 
itor CBR-470-1, or shRNA depletion of PGK1 protein levels (Extended 
Data Fig. 3d-f). Thermal stability assays in the presence of CBR-470-1 
resulted in a consistent shift in PGK1 stability, and isothermal dose 
response profiling’? against PGK1 and GAPDH also confirmed the 
selective, dose-dependent alteration of PGK1 stability in the presence 
of CBR-470-1 (Extended Data Fig. 3g-i). Furthermore, transient and 
viral shRNA knockdown of PGK1 in IMR32 cells activated the ARE- 
LUC reporter signal and expression of NQO1 (Fig. 2b, c). Knockdown 
or overexpression of PGK1 protein modulated the NRF2 reporter, with 
decreased or increased observed CBR-470-1 ECso values, respectively 
(Fig. 2d, e). Finally, depletion of enolase 1, an enzyme downstream 
of PGK1, was also found to induce the ARE-LUC signal in IMR32 
cells (Extended Data Fig. 3j). These results suggest that CBR-470-1 
modulation of PGK] activity, and therefore glycolysis, regulates NRF2 
activation. 

Consistent with the PGK1 inhibitory activity of CBR-470-1 
(Extended Data Fig. 4a, b), targeted metabolomic profiling*”’ of IMR32 
cells treated with compound revealed a rapid increase in metabolite 
levels upstream of PGK1 (1,3- and 2,3-bisphosphoglycerate (BPG), and 
p-glyceraldehyde-3-phosphate (GAP)), and depletion of downstream 
metabolites such as 3-phosphoglycerate and lactate, which mirrored 
the profile observed upon viral knockdown of PGK1 in IMR32 cells 
(Fig. 2f, Extended Data Fig. 4c, d, Extended Data Table 2). In addition, 
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the co-treatment of ARE-LUC-expressing reporter cells with CBR- 
470-1 and an inhibitor of glucose entry into glycolysis, 2-deoxyglucose, 
significantly reduced reporter activation in a dose-dependent manner 
(Fig. 2g). Together, these data suggested that glycolytic intermediates 
may serve as a signal to the NRF2 signalling axis. 

We first investigated whether 1,3-BPG, which is directly metabo- 
lized by PGK1, could be involved in signalling to the KEAP1-NRF2 
pathway via phosphoglyceryl-lysine modification of KEAP1. However, 
CBR-470-1 treatment of IMR32 cells for 30 min, a time at which 1,3- 
BPG levels are increased, did not result in altered KEAP1 levels, or 
any anti-phosphoglyceryl-lysine immunoreactive bands using poly- 
clonal antibodies raised against the phosphoglyceryl-lysine epitope 
(Extended Data Fig. 5a—-c). These western blots did, however, reveal 
the appearance of a CBR-470-1 dose-dependent, high-molecular mass 
KEAP1 (HMM-KEAP1) band at roughly twice the molecular mass of 
monomeric KEAP1 (Fig. 3a). The HMM-KEAP1 band was stable to 
reduction (Extended Data Fig. 5d) and exhibited kinetics and dose- 
dependent formation consistent with CBR-470-1-dependent NRF2 
stabilization and NQO1 induction (Fig. 1b), but distinct from the direct 
KEAPI alkylator TBHQ (Extended Data Fig. 5e—j). Co-treatment 
of cells with CBR-470-1 and either reduced glutathione (GSH) or 
N-acetylcysteine inhibited HMM-KEAP1 band formation (Extended 
Data Fig. 5k). Knockdown of PGK1, which activates NRF2 target gene 
expression, also formed HMM-KEAP1, and this band was competed 
by co-treatment with GSH (Extended Data Fig. 51). Together, these data 
indicated that modulation of glycolysis by CBR-470-1 results in the 
formation of HMM-KEAP1 that is consistent with a covalent KEAP1 
dimer, which has been previously observed*!-?3, but remained unchar- 
acterized at the molecular level. 

Several central glycolytic metabolites other than 1,3-BPG contain 
reactive functionalities, including the triosephosphate isomers GAP and 
dihydroxyacetone phosphate, as well as their non-enzymatic elimination 
product methylglyoxal (MGO), an electrophilic dicarbonyl compound 
that has been found to form numerous modifications on nucleophilic 
residues in proteins”*”°. Among these candidates, only treatment of 
cell lysates or live cells with MGO resulted in the selective formation 
of HMM-KEAPI (Fig. 3b, c). Treatment of Flag-KEAP1-containing 
lysates or purified KEAP1 with freshly distilled MGO induced 
dose-dependent formation of HMM-KEAP1 at mid-micromolar 
concentrations (Extended Data Fig. 5m, n), which is consistent with the 
range of MGO concentrations previously reported in living cells*®”. 
MGO treatment in cells functionally activated expression of the down- 
stream NRF2 target genes NQOI and HMOX]1 (Fig. 3d). Targeted 
LC-MS measurement of derivatized MGO confirmed that CBR-470-1 
treatment resulted in a significant increase in cellular MGO levels in 
the first few hours of treatment (Fig. 3e, Extended Data Fig. 6a—c), 
which, along with pathway activation, was sensitive to GSH treatment 
(Extended Data Fig. 6d-f). To test the involvement of MGO in KEAP1- 
NRF? signalling further, we perturbed its degradation, which is medi- 
ated by GSH and glyoxylase 1 (GLO1). Knockdown of GLO1 by shRNA 
resulted in ARE-LUC reporter activation (Fig. 3f), and also sensitized 
cells to CBR-470-1 activation of the ARE-LUC reporter (Extended 
Data Fig. 6g). Direct modulation of GLO1 enzymatic activity with a 
cell-permeable inhibitor (GLOi) also amplified reporter activation by 
CBR-470-1 (Extended Data Fig. 6h). Collectively, these metabolomic, 
proteomic and transcriptomic data established shared kinetics between 
MGO accumulation, HMM-KEAP1 formation and NRF2 pathway acti- 
vation, suggesting the existence of a direct link between glycolysis and 
the KEAP1-NRF2 signalling pathway that is mediated by the modifi- 
cation of KEAP1 by MGO. 

A stable isotope labelling with amino acids in cell culture (SILAC)- 
based quantitative proteomic approach (Extended Data Fig. 7a) sug- 
gested the N-terminal region (amino acids 1-50) and BTB domains 
(amino acids 150-169) as candidate domains and residues that could 
be involved in HMM-KEAP1 formation in response to the CBR-470- 
1-induced increase in MGO (Extended Data Fig. 7b-d). We therefore 
examined more than a dozen C-to-S, K-to-M/R and R-to-A mutations 
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Fig. 4 | Methylglyoxal forms a posttranslational modification between 
proximal cysteine and arginine residues in KEAP1. a, Quantified 
HMM-KEAPI formation of wild-type (WT) or mutant Flag-KEAP1 from 
HEK293T cells treated with DMSO or CBR-470-1 for 8 h (n= 23 for WT; 
n= 16 for R15A; n= 13 for C151S; n=7 for K39R, R135A; n=4 for R6A, 
RSOA, all other C-to-S mutations, and R15/135A and C151S triple-mutant; 
n=3 for R15/135A, and all K-to-M mutations). b, Schematic of the model 
peptide screen for intramolecular modifications formed by MGO and 
nucleophilic residues. c, Total ion and extracted ion chromatograms (TICs 
and EICs, respectively) from MGO- and mock-treated peptide, with a new 
peak in the former condition marked by an asterisk. EICs are specific to 
the indicated m/z (n =3 independent biological replicates). d, 'H-NMR 
spectra of the unmodified (top) and MICA-modified (bottom) model 
peptide, with pertinent protons highlighted in each. Notable changes 

in the MICA-modified spectrum include the appearance of a singlet at 


within these domains, as well as other known functional residues in 
KEAPI1, for their effect on HMM-KEAP1 formation. Two arginine 
residues (R15 of the N-terminal region domain and R135 of the BTB 
domain) significantly, but incompletely, reduced the formation of 
HMM-KEAPI (Fig. 4a). More notable was the near complete inhibition 
of HMM-KEAP1 formation of the C151S mutant in the BTB domain 
(Fig. 4a). Consistent with this effect, levels of C151-containing tryptic 
peptides were reduced by MGO treatment, and the pre-treatment of 
cells with BARD, which alkylates C151, inhibited HMM-KEAP1 for- 
mation (Extended Data Fig. 7d, e). C151 lies in an exposed region of 
the BTB domain that is predicted to mediate the homodimeric inter- 
face between two KEAP1 monomers, which is necessary for proper 
NRF2 binding and ubiquitination®”*. Therefore, the strong abrogation 
of HMM-KEAP1 formation by the mutation of C151 and proximal 
arginine residues suggested that MGO may be mediating an unchar- 
acterized modification between these residues. 

To identify this modification, we synthesized a model peptide con- 
taining cysteine and arginine separated by a glycine linker, which was 
intended to mimic high inter- or intramolecular cysteine and arginine 
proximity, and treated it with MGO at physiological temperature and 
pH overnight (Fig. 4b). LC-MS analysis revealed a new peak, which 
corresponded to a mass increase of 36 Da, consistent with a mercapto- 
methylimidazole crosslink (Fig. 4c, d) formed by nucleophilic attack 
of the dicarbonyl by the side chains of cysteine and arginine, followed 
by dehydration-mediated formation of a methylimidazole crosslink 
between cysteine and arginine (MICA) posttranslational modification. 
We purified this product and confirmed its structure by a series of 
one- and two-dimensional NMR experiments (Fig. 4d, Extended Data 
Fig. 7f-i). To determine whether MICA modification occurs within 
KEAP1 protein, we treated cells with CBR-470-1 or MGO, isolated 
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2.04 p.p.m., loss of the thiol proton at 2.43 p.p.m., and changes in chemical 
shift and splitting pattern of the cysteine beta protons and the arginine 
delta and epsilon protons. Full spectra and additional multidimensional 
NMR spectra can be found in Extended Data Fig. 7. e, EIC from LC-MS/MS 
analyses of gel-isolated and digested HMM-KEAP1 (CBR-470-1 and 
MGO-induced) and monomeric KEAP1 for the C151-R135 crosslinked 
peptide. Slight retention time variation was observed on commercial 
columns (n = 3 independent biological replicates). f, PRM chromatograms 
for the parent and six parent-to-daughter transitions in representative 
targeted proteomic runs from HMM-KEAP1 and monomeric digests 
(n=6). g, Schematic depicting the direct communication between glucose 
metabolism and KEAP1-NRF2 signalling mediated by MGO modification 
of KEAP1 and subsequent activation of the NRF2 transcriptional program. 
Statistical analyses are by univariate two-sided t-test (a). Data are 

mean + s.e.m. of biologically independent samples. 


HMM-KEAP1 and monomeric KEAP1 by gel electrophoresis, and then 
digested these discrete populations for LC-MS/MS analyses. A peptide 
bearing a MICA crosslink between C151 and R135 was identified in 
isolated HMM-KEAP1 after both CBR-470-1 and MGO treatment, 
but not in the isolated monomeric KEAP1 (Fig. 4e). Furthermore, 
parallel-reaction monitoring (PRM) confirmed the presence and co- 
elution of more than a dozen parent-to-daughter ion transitions that were 
uniquely present in HMM-KEAPI (Fig. 4f, Extended Data Fig. 8a, b). 
These studies suggest a model in which glycolytic metabolic status is 
coupled to NRF2-dependent gene expression through the direct inter- 
action of a reactive glycolytic metabolite, MGO, and the sentinel protein 
KEAP!1 via the formation of a stable and mechanistically novel protein 
posttranslational modification (Fig. 4g). 

Although MGO has been previously shown to form covalent 
modifications on diverse proteins, the compositions, sites and functions 
of these modifications have remained largely unknown. Similarly, 
several recent reports have implicated MGO in the pathogenesis of 
diseases such as diabetes”® and ageing”’, but the discrete molecular 
targets of MGO in these contexts are unidentified. Here we found that 
inhibition of PGK1 increases levels of triosephosphates, which results in 
increased levels of cellular MGO and the formation of a HMM-KEAP1 
species that leads to NRF2-dependent gene expression. Formation of 
the HMM-KEAPI species involves the posttranslational modification 
MICA that is dependent on MGO and forms a covalent linkage between 
proximal cysteine and arginine residues. These results raise intriguing 
questions about the general reactivity of MGO, its potential role as a 
signalling metabolite in other cellular processes, and the specific mod- 
ifications involved in often-cited advanced glycated end products as 
biomarkers of disease pathology. We have shown that both cellular 
and lysate treatment with MGO results in the selective modification of 
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C151 in KEAP1, probably owing to the intrinsic hyperreactivity of this 
residue, and the presence of properly oriented arginine(s) that enables 
formation of the MICA modification. Additional factors such as local 
metabolite concentration gradients may also contribute to MICA for- 
mation in KEAP1. Future studies to determine the full target profile of 
MGO, and specifically other inter- and intramolecular MICA modifi- 
cations, are expected to shed light on this model and provide a global 
view of MGO modification sites in the proteome. 

The direct connection between glucose metabolism and the KEAP1- 
NRF2 axis by MGO adds a further layer of regulation to both pathways 
and global metabolic status. First, this connection highlights the role of 
glycolysis in regulating cellular redox status beyond the contribution to 
NADPH and glutathione production. These reducing equivalents are 
crucial for the regulation of a wide range of reactive species in the cell, 
and when these levels are deregulated, the KEAP1-NRF2 pathway is 
poised to respond and limit cellular damage. Recent studies have also 
implicated the output of the NRF2 transcriptional program in the direct 
detoxification of MGO through increased glutathione synthesis*”, 
GLO1 transcription, as well as the redirection of glucose carbon away 
from central metabolites (for example, MGO) and into the pentose 
phosphate pathway". Therefore, the direct coupling of glucose metabo- 
lism with KEAP1 function through MGO creates an intrinsic feedback 
loop to sense and respond to changing metabolic demands in the cell. A 
final aspect of this study is the notion that modulation of endogenous 
reactive metabolite levels using small molecules may represent an alter- 
native approach towards activating the ARE pathway for the treatment 
of several diseases that involve metabolic stress. 
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METHODS 

No statistical methods were used to predetermine sample size. Chemicals. 
TBHQ, 2-deoxyglucose, MGO and GSH were obtained from Sigma Aldrich. 
The synthesis of AI-1 has been described previously”. The GLO1 inhibitor 
(CAS no. 174568-92-4) was from MedChemExpress. CBR-470-0 and CBR-581-9 
were from ChemDiv. CBR-470-1 (initially from ChemDiv as D470-2172) and 
related analogues were synthesized in house according to full methods described 
in the Supplementary Information. All commercially obtained chemicals were 
dissolved in DMSO and used without further purification, with the exception 
of 2-deoxyglucose, MGO and GSH, which were delivered as aqueous solutions. 
Cell culture. IMR32, SH-SY5Y, HeLa and HEK293T cell lines were purchased 
from ATCC. Human lung fibroblasts and mouse dermal fibroblasts (C57BL/6- 
derived) were obtained from Sciencell and used before passage 3. IMR32, human 
lung fibroblasts, SH-SY5Y, HeLa and HEK293T cells were propagated in DMEM 
(Corning) supplemented with 10% fetal bovine serum (FBS, Corning) and 1% 
penicillin/streptomycin (Gibco). Mouse dermal fibroblasts were propagated in 
fibroblast medium 2 from Sciencell. Mouse epidermal keratinocytes (MPEK-BL6) 
were obtained from Zen Bio and propagated in epidermal keratinocyte medium 
(Zen Bio). 

High-throughput screening and ARE-LUC reporter assay. For high-throughput 
screening, IMR32 cells were plated at 5 x 10° cells per well in white 384-well plates 
in 4011 of growth medium. The next day 100 ng of pTI-ARE-LUC reporter plasmid 
in 10,1 of Optimem medium (Gibco) was transfected into each well using Fugene 
HD at a dilution of 1 1g plasmid DNA:4 11 of Fugene. After 24 h, compounds were 
transferred using a 100 nl pintool head affixed to PerkinElmer FX instrument such 
that the final screening concentration was 21M. After 24 h incubation, ARE-LUC 
luminance values were recorded on an Envision instrument after the addition of 
30,11 of Bright Glo reagent solution (Promega, diluted 1:3 in water). Compounds 
which increased ARE-LUC signal greater than four Z-scores from plate mean were 
deemed hits. For overexpression and knockdown experiments in HEK293T with 
ARE-LUC reporter readouts, 5 x 10° cells were plated on poly-p-lysine-coated 
plates and transfected with 1.5 1g of overexpression or shRNA plasmid and 500 ng 
of pTI-ARE-LUC using Optimem medium and Fugene in the same mode as above. 
After 24 h, 103 transfected cells were plated in 50 ul of growth medium in white 
96-well plates. After a 24 h incubation, a further 50,11 of growth medium with com- 
pound at the indicated concentrated was added to each well. ARE-LUC luminance 
values were recorded on an Envision plate reader 24 h later by the addition of 75 1l 
of Bright Glo reagent solution (1:3 in water). 

Peroxide stress model. Approximately 10* SH-SY5Y cells were plated in 100 of 
growth medium in white 96-well plates. After 48 h of compound treatment, 20 1l 
TBHP diluted to the indicated concentrations was added to each well. After an 
8 h incubation, cell viability measurements were recorded on an Envision plate 
reader after the addition of 50,11 of a Cell Titer Glo solution (Promega, diluted 1:6 
in water). Relative viabilities are reported as a fraction relative to the same dose of 
compound treatment without TBHP. 

shRNA knockdown studies. PGK1-targeting shRNA vectors sh10 and sh47 
refer to Sigma Mission shRNA lentiviral clones NM_000291.2-338s1cl and 
NM_000291.2-935slcl respectively. GLO1-targeting shRNA vectors sh29 
and sh30 refer to Sigma Mission shRNA lentiviral clones NM_006708.1-195s1cl and 
NM_006708.1-292s1c1, respectively. The non-targeting scrambled control vector 
refers to SHC002 (Sigma). Lentiviruses were generated in HEK293T cells by 
transient expression of the above vectors with pSPAX2 and pMD2.G packaging 
vectors (Addgene plasmids 11260 and 12259). Viral supernatants were collected 
after 48 h of expression and passed through a 70-\1m syringe filter before exposure 
to target cells. 

qRT-PCR. Cells were collected by trypsinization and subsequent centrifugation 
at 500g. RNA was isolated using RNeasy kits from Qiagen and concentrations 
obtained using a NanoDrop instrument. Then 500 ng-5 1g of RNA was then 
reverse transcribed with oligo dT DNA primers using a SuperScript III First-Strand 
Synthesis kit from Invitrogen. Quantitative RT-PCR reactions were measured on 
a Viia 7 Real-Time PCR system (Thermo) using a Clontech SYBR green-based 
master mix. Gene-specific primer sets are provided in Extended Data Table 1. 
Reactions were normalized to TUBG1 levels for each biological replicate and rela- 
tive transcript abundance calculated using the comparative C, method. 

GSEA. Total RNA was extracted from IMR32 cells treated for 24 h with either DMSO 
or 51M CBR-470-1 (3 biological replicates per condition) using an RNeasy kit 
(Qiagen). RNA sequencing (RNA-seq) experiments were performed by the Scripps 
Next Generation Sequencing Core according to established in house methods. 
GSEAs and leading edge heat maps were generated with TPM values from the 
above experiment using the java GSEA package. ‘NFE2L2 targets’ gene set refers 
to Molecular Signature Database (http://software.broadinstitute.org/gsea/msigdb) 
gene set ID M2662. 

Quantitative metabolomic profiling. For polar metabolite profiling experiments, 
cells were grown in 15 cm plates and cultured in RPMI supplemented with 10% 
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FBS, 2 mM t-glutamine and 1% penicillin/streptomycin before media replacement 
containing either vehicle (DMSO) or the indicated dose of CBR-470-1. After incu- 
bation for the appropriate time, cells were scraped into ice-cold PBS and isolated 
by centrifugation at 1,400g at 4°C. Cell pellets were resuspended in 30011 of an 
80:20 mixture of cold methanol/water, an internal standard was added (10 nmol 
d3-serine; Sigma Aldrich), and the suspension was sonicated (Fisher Scientific, 
FB-505) for 5 s followed by a 10 min centrifugation at 16,000g. The supernatant was 
collected, dried under N> gas and resulting dried metabolites resuspended in 30 il 
of 40% methanol/water for analysis on an Agilent triple quadrupole LC-MS/MS 
(Agilent Technologies, 6460 QQQ). For negative mode operation, metabolites 
were separated by hydrophilic interaction chromatography with a Luna-NH; col- 
umn (Phenomenex) running mobile phase A (CH3CN supplemented with 0.2% 
NH,OH) and B (95:5 (v/v) H2O:CH3CN supplemented with 50 mM ammonium 
acetate and 0.2% NH,4OH) and the following gradient: 0% B for 3 min; linear 
increase to 100% B for 27 min at a flow rate of 0.4 ml min, followed by an isocratic 
flow of 100% B for 3 min. The spectrometer settings were: capillary voltage = 4.0 kV, 
drying gas temperature = 350°C at 101 min!, and the nebulizer pressure was 
45 p.s.i. Metabolite peak transitions and retention times are listed in Extended Data Table 2 
and were confirmed by running standards for measured glycolytic, pentose phosphate 
pathway, tricarboxylic acid cycle and co-factor metabolites. 2-phosphoglycerate 
and 3-phosphoglycerate isomers were quantified in aggregate. Relative metabolite 
abundance was quantified by integrated peak area for the given multiple reac- 
tion monitoring transition, and all metabolite levels were normalized to inter- 
nal standard extracted ion intensity values for d3-serine. These parameters were 
used to quantify all metabolites, with the exception of 1,3-BPG and MGO, which 
required chemical derivitization to stable intermediates before LC-MS/MS quan- 
tification, as previously reported*°**. MGO deviated from all other metabolites, 
as it was separated on a Gemini reverse-phase C18 column (5 mm, 4.6 mm x 
50 mm; Phenomenex) together with a pre-column (C18, 3.5 mm, 2 mm x 20 mm) 
and detected in positive mode analysis, with mobile phase A (H2O) and B (50:50 
(v/v) H20:CH3CN) supplemented with 0.1% trifluoroacetic acid. The gradient 
started with 0% B for 2 min and increased linearly to 100% B over 10 min with a 
flow rate of 0.4 ml min“, followed by an isocratic gradient of 100% B for 5 min 
at 0.4 ml min™!, The QQQ settings were the same as above. 

Flag-tagged protein expression and western blotting. Full-length, human PGK1 
(NM_000291, Origene) transiently expressed from a pCMV6 entry vector with 
a C-terminal Myc-DDK tag; full-length, human KEAP1 (28023, Addgene) was 
transiently expressed from a pcDNA/FRT/TO plasmid with a C-terminal 3 x Flag 
tag. All references to Flag~PGK1 or Flag~KEAP!1 represent the proteins in the 
aforementioned vectors, respectively. Transient protein expression was performed 
in confluent 10 cm plates of HEK293T cells by transfection of 1g plasmid with 
Lipofectamine 2000 (Invitrogen) according to manufacturer’s protocol. For in 
situ compound or metabolite treatment experiments, compounds were added 
approximately 24 h after transfection, and incubated for the indicated duration. For 
Flag-KEAP1 western blotting and immunoprecipitation experiments, cells were 
collected by scraping, pelleted by centrifugation, washed twice with PBS and lysed 
in 8 M urea, 50 mM NH4HCO3, phosphatase inhibitor cocktail (Sigma Aldrich), 
and EDTA-free complete protease inhibitor (Roche), pH 8.0, at 4°C. Lysate was 
sonicated (Fisher Scientific, FB-505), insoluble debris cleared by centrifugation, 
and the supernatant was diluted into 4x Laemmli buffer containing 50 mM 
dithiothreitol (DTT) as a reducing agent. Samples were prepared for SDS-PAGE 
by heating to 95°C for 5 min, cooled to room temperature, resolved on NuPAGE 
Novex 4-12% Bis-Tris Protein Gels (Invitrogen), and transferred onto nitro- 
cellulose membranes by standard western blotting methods. Membranes were 
blocked in 2% BSA in TBS containing 0.1% tween-20 (TBST) and probed with 
primary and secondary antibodies. Primary antibodies used in this study include: 
anti-Flag-M2 (1:1,000, F1804, Sigma Aldrich), anti-KEAP1 (1:500, SC-15246, 
Santa Cruz), anti- HSPA1A (1:1,000; 4872, Cell Signaling), anti- ACTB (1:1,000, 
4790, Cell Signaling), anti-GAPDH (1:1,000; 2118S, Cell Signaling) and TUBG 
(1:1,000; 5886, Cell Signaling). Rabbit polyclonal anti-pgK antibody was gener- 
ated using pgK-modified KLH and affinity purification as described‘ at a 1:400 
dilution of a 0.33 mg ml“! stock in 10 mM sodium HEPES (pH 7.5), 150 mM 
NaCl, 30% glycerol and 0.02% sodium azide. Secondary donkey anti-rabbit, don- 
key anti-goat, and donkey anti-mouse (Licor), were used at 1:10,000 dilution in 
2% BSA-containing TBST and incubated for 1 h before washing and imaging on a 
Licor infrared scanner. Densitometry measurements were performed with Image] 
software. 

Time- and dose-dependent CBR-470-1 treatment studies were performed in 
HEK293T cells 24 h after transient transfection of Flag-KEAP1, or in IMR32 cells 
for endogenous KEAP1. Fresh RPMI media with 10% FBS, 2 mM t-glutamine, 1% 
penicillin/streptomycin and the indicated concentration of CBR470-1 (201M for 
time-dependent experiments) or equivalent DMSO was added to cells in 10 cm 
dishes. After the indicated incubation time, cells were lysed in lysis buffer (50 mM 
Tris, 150 mM NaCl, 1% Triton-X 100, phosphatase inhibitor cocktail (Sigma 
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Aldrich), and EDTA-free complete protease inhibitor (Roche), pH 7.4) and pro- 
cessed for western blot as indicated above. 
Target identification studies with CBR-470-PAP. Confluent IMR32 cells in 10 cm 
dishes were exposed to 541M CBR-470-PAP with either DMSO or a 50-fold molar 
excess of CBR-470-1 (2501M) for 1 h at 37°C. Samples were then UV-crosslinked 
using a Stratalinker 2400 instrument for 10 min. RIPA-extracted lysates were then 
fractionated with ammonium sulfate with 20% increments. These fractions were 
then separated via SDS-PAGE and relevant probe labelling was determined by 
anti-biotin (1:500, ab1227, Abcam) western blotting as above. A parallel gel was 
silver stained using the Pierce silver stain kit. Relevant gel slices from the 80% frac- 
tion were excised and PGK1 identity was determined by LC-MS/MS by the Scripps 
Center for Metabolomics and Mass Spectrometry. Follow-up shRNA knockdown 
studies confirmed PGK] as the target within this fraction. 
Dye-based thermal denaturation assay. Thermal denaturation experiments 
were performed using a Protein Thermal Shift Dye Kit (ThermoFisher, 4461146). 
Reactions contained 21M recombinant PGK] with the indicated dose of aqueously 
delivered CBR-470-1 with 1x supplied thermal shift dye and reaction buffer in 
201] reaction volumes. Fluorescence values were recorded using a Viia7 Real-Time 
PCR instrument according to supplied instructions. 
Recombinant PGK1 assay. PGK1 enzymatic activity in the forward direction 
was measured with a coupled enzymatic assay**. Three PGK1 conditions were 
prepared by dissolving recombinant PGK1 in potassium phosphate buffer (10 mM 
KH2PO,, 10 mM MgSO,, pH 7.0), and transferring the aliquots of PGK1 solution 
to the microtubes being treated with same amount of DMSO and indicated con- 
centrations of CBR-470-1. Final concentration of PGK1 is 20 ng ml“! and DMSO 
is 1% for each sample. Two blank conditions, 0j.M and 100\1M of CBR-470-1 
without PGK1, were also prepared for the control measurements. All PGK1 sam- 
ples and blank samples were pre-incubated for 20 min and then transferred to the 
UV-transparent 96 well plate (Corning). The assay solution (10 mM KH,POg,, 2 mM 
G3P, 0.6 mM NAD*, 200 mM glycine, 0.4 mM ADP, pH 7.0) was activated by 
adding GAPDH with 101g ml“ final concentration, and then the assay solution 
was added to the wells containing PGK1 samples and blank samples. The change 
in absorbance at 340 nm at room temperature was measured every 20 s for 45 min, 
by Tecan Infinite M200 plate reader. Each condition was performed with three 
independent replications. 
Isothermal dose response profiling of PGK1. In vitro thermal profiling assay for 
recombinant proteins was performed by dissolving pure recombinant PGK1 and 
GAPDH into PBS, and dividing equal amount of mixture into 9 aliquots. Each 
aliquot was transferred to 0.2 ml PCR microtubes being treated with different 
amounts of CBR-470-1 added from DMSO stock, and equal amount of DMSO 
for the control. Each microtube contains 50 il of mixture with final concentration 
of 451g ml“! for each protein and DMSO concentration 1% with following final 
concentrations of CBR-470-1; 011M, 0.14.M, 0.31.M, 14M, 344M, 101M, 33M, 
100M and 333M. After a 30 min incubation at 25°C, samples were heated at 
57°C for 3 min followed by cooling at 25°C for 3 min using Thermal Cycler. The 
heated samples were centrifuged at 17,000g for 20 min at 4°C, and the supernatants 
were transferred to new Eppendorf tubes. Control experiments were performed 
with heating at 25°C for 3 min, instead of 57°C. Samples were analysed by SDS- 
PAGE and western blot. 
Metabolite treatments and HMM-KEAP!1 screening. For in vitro screening 
of glycolytic metabolites, HEK293T cells expressing Flag-KEAP1 were lysed by 
snap-freeze-thaw cycles (3 x) in PBS, pH 7.4, containing EDTA-free complete pro- 
tease inhibitor (Roche). Lysates were cleared by centrifugation and the supernatants 
normalized for concentration by Bradford reagent (2 mg ml7'). Concentrated 
stocks of each metabolite were made in PBS, which were added to the lysate samples 
for the final indicated concentrations and incubated at 37°C for 2.5 h with shaking. 
After incubation, samples were denatured with 6 M urea and processed for SDS- 
PAGE and western blotting. Methylglyoxal (40% (v/v) with HO), GAP, dihydroxy- 
acetone phosphate, and 2,3-BPG were all obtained from Sigma Aldrich and used as 
PBS stocks. In situ metabolite treatments were performed in HEK293T cells 24 h 
after transfection of Flag-KEAP1, treated with MGO (1 or 5 mM) in HO (Sigma) 
or equivalent vehicle alone for 8 h. Cells were collected by scraping, washed in PBS 
and centrifuged, and lysed in urea lysis buffer and analysis by SDS-PAGE and west- 
ern blot. Dose-response experiments were performed with high purity MGO was 
prepared by acidic hydrolysis of MG-1,1-dimethylacetal (Sigma Aldrich) followed 
by fractional distillation under reduced pressure and colorimetric calibration of the 
distillates, as previously reported. For in vitro MGO dose-response dimerization 
of KEAP1, HEK293T cells expressing Flag-KEAP 1 were lysed in PBS as indicated 
above, then serial dilutions of high purity MGO in 50 mM sodium phosphate, 
pH 7.4, were added to the equal volume of lysate aliquots with final protein con- 
centration of 1 mg ml~'. Each mixture was incubated at 37°C for 8 h with rotating, 
HMM-KEAP1 formation was analysed by SDS-PAGE and western blot. 

For studies with recombinant KEAP1, Flag-KEAP1 was expressed in HEK293T 
cells from transient transfection of the Flag-KEAP1 plasmid (Addgene plasmid 


28023). Flag-KEAP1 protein was immunopurified after overnight incubation at 
4°C with anti-Flag M2 magnetic beads (Sigma) in RIPA buffer in the presence of 
protease inhibitors, eluted with 3 x Flag peptide (150 ng ml!) in PBS, and desalted 
completely into PBS. 500 ng of purified Flag-KEAP1 protein was then subjected to 
reducing conditions with the addition of either TCEP (0.1 mM) or DTT (1 mM) 
for 10 min at 37°C. MGO was then added to a final concentration of 5 mM and 
incubated for 2 h at 37°C. Reactions were quenched by the addition of 5011 of 4x 
sample buffer and subsequent boiling for 10 min. 121 of this reaction was then 
separated by SDS-PAGE and the presence of HMM-KEAP1 evaluated by anti-Flag 
western blotting as described or by silver staining using the Pierce Silver Stain Kit 
(ThermoFisher Scientific). 

Site-directed mutagenesis of KEAP1. KEAP1 mutants were generated with 
PCR primers in Extended Data Table 1 according to the Phusion site-directed 
mutagenesis kit protocol (F-541, Thermo Scientific) and the QuikChange site- 
directed mutagenesis kit protocol (200523, Agilent). Mutant KEAP1 plasmids were 
verified by sequencing (CMV (forward), wild-type primers in the middle of the 
KEAP1 sequence (forward) and BGH (reverse)), and were transiently expressed in 
HEK293T cells in the same manner as wild-type KEAP1. Screening of CBR-470-1- 
induced HMM-KEAPI formation with mutant constructs was performed just as 
with wild-type KEAP1, after 8 h CBR-470-1 treatment (201M). After treatment, 
cells were collected and prepared for SDS-PAGE and western blotting as indicated 
above. 

SILAC cell culture methods and proteomic sample preparation. SILAC labelling 
was performed by growing cells for at least five passages in lysine- and arginine-free 
SILAC medium (RPMI, Invitrogen) supplemented with 10% dialysed fetal calf 
serum, 2 mM t-glutamine and 1% penicillin/streptomycin. ‘Light’ and ‘heavy’ 
media were supplemented with natural lysine and arginine (0.1 mg ml‘), and 
13C., 15N-labelled lysine and arginine (0.1 mg ml~), respectively. 

General protein digestion for LC-MS/MS analysis was performed by dissolving 
protein (for example, whole lysate or enriched proteins) in digestion buffer (8 
M urea, 50 mM NH4HCO;, pH 8.0), followed by disulfide reduction with DTT 
(10 mM, 40 min, 50°C), alkylation (iodoacetamide, 15 mM, 30 min, room temper- 
ature, protected from light) and quenching (DTT, 5mM, 10 min, room tempera- 
ture). The proteome solution was diluted fourfold with ammonium bicarbonate 
solution (50 mM, pH 8.0), CaCl, added (1 mM) and digested with sequencing 
grade trypsin (~1:100 enzyme/protein ratio; Promega) at 37°C while rotating over- 
night. Peptide digestion reactions were stopped by acidification to pH 2-3 with 1% 
formic acid, and peptides were then desalted on ZipTip C18 tips (100 il, Millipore), 
dried under vacuum, resuspended with LC-MS grade water (Sigma Aldrich), and 
then lyophilized. Lyophilized peptides were dissolved in LC-MS/MS buffer A (H2O 
with 0.1% formic acid, LC-MS grade, Sigma Aldrich) for proteomic analysis. 
Proteomic LC-MS/MS and data analysis. LC-MS/MS experiments were per- 
formed with an Easy-nLC 1000 ultra-high pressure LC system (ThermoFisher) 
using a PepMap RSLC C18 column heated to 40°C (column: 75 ym x 15 cm; 
341m, 100 A) coupled to a Q Exactive HF orbitrap and Easy-Spray nanosource 
(ThermoFisher). Digested peptides (500 ng) in MS/MS buffer A were injected onto 
the column and separated using the following gradient of buffer B (0.1% formic 
acid acetonitrile) at 300 nl min~!: 0-2% buffer B over 10 min, 2-40% buffer B over 
120 min, 40-70% buffer B over 10 min, and 70-100% buffer B over 5 min. MS/MS 
spectra were collected from 0 to 150 min using a data-dependent, top-20 ion setting 
with the following settings: full MS scans were acquired at a resolution of 120,000, 
scan range of 400-1,600 m/z, maximum IT of 50 ms, AGC target of 1 x 10°, and 
data collection in profile mode. MS/MS scans were performed by HCD fragmen- 
tation with a resolution of 15,000, AGC target of 1 x 10°, maximum IT of 30 ms, 
NCE of 26, and data type in centroid mode. Isolation window for precursor ions 
was set to 1.5 m/z with an underfill ratio of 0.5%. Peptides with charge state >5, 
1 and undefined were excluded and dynamic exclusion was set to eight seconds. 
Furthermore, S-lens RF level was set to 60 with a spray voltage value of 2.60 kV 
and ionization chamber temperature of 300°C. 

MS/MS files were generated and searched using the ProLuCID algorithm 
in the Integrated Proteomics Pipeline (IP2) software platform. Human pro- 
teome data were searched using a concatenated target/decoy UniProt database 
(UniProt_Human_reviewed_04-10-2017.fasta). Basic searches were performed 
with the following search parameters: HCD fragmentation method; monoisotopic 
precursor ions; high resolution mode (3 isotopic peaks); precursor mass range 
600-6,000 and initial fragment tolerance at 600 p.p.m.; enzyme cleavage specificity 
at C-terminal lysine and arginine residues with three missed cleavage sites per- 
mitted; static modification of +57.02146 on cysteine (carboxyamidomethylation); 
two total differential modification sites per peptide, including oxidized methionine 
(+15.9949); primary scoring type by XCorr and secondary by Zscore; minimum 
peptide length of six residues with a candidate peptide threshold of 500. A mini- 
mum of one peptide per protein and half-tryptic peptide specificity were required. 
Starting statistics were performed with a Amass cutoff= 15 p.p.m. with modstat, 
and trypstat settings. False discovery rates of peptide (sfp) were set to 1%, peptide 
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modification requirement (-m) was set to 1, and spectra display mode (-t) was 
set to 1. SILAC searchers were performed as above with light and heavy database 
searches of MS1 and MS2 files by including static modification of +-8.014168 for 
lysine and +10.0083 for arginine in a parallel heavy search. SILAC quantification 
was performed using the QuantCompare algorithm, with a mass tolerance of 
10 p.p.m. or less in cases in which co-eluting peptide interferes. In general, all 
quantified peptides have a mass error within 3 p.p.m. 

Quantitative proteomic detection of potential KEAP1 modification sites. 
Quantitative surface mapping with SILAC quantitative proteomics was per- 
formed with heavy- and light-labelled HEK293T cells expressing Flag-KEAP1. 
Cells were incubated with DMSO alone (light cells) or CBR-470-1 (20|.M, heavy 
cells) for 8 h. After incubation cells were scraped, washed with PBS (3 x) and com- 
bined before lysis in urea lysis buffer (8 M urea, 50 mM NH,HCO3, nicotinamide 
(1 mM), phosphatase inhibitor cocktail (Sigma Aldrich), and EDTA-free complete 
protease inhibitor (Roche), pH 8.0) by sonication at 4°C. After sonication insoluble 
debris was cleared by centrifugation (17,000g, 10 min), diluted with Milli-Q water 
to give 1 M urea, and lysate was incubated with anti-Flag M2 resin (100, slurry, 
A2220, Sigma Aldrich) at 4°C overnight while rotating. For SILAC label-swap 
experiments, light HEK293T cells were incubated with CBR-470-1 and heavy cells 
were incubated with DMSO and processed as above. Flag resin was washed with 
PBS (7 x 1 ml), Flag-KEAP!1 protein eluted with glycine-HC] buffer (0.1 M glycine, 
pH 3.5, 2 x 50011), followed by 8 M urea (2 x 100,11). The combined eluent was 
brought up to 8 M urea total concentration and processed for trypsin digestion 
and LC-MS/MS analysis as indicated above. 

The SILAC maps were generated by comparing SILAC ratios for each peptide, 

relative to the median value for all KEAP1 peptides. SILAC ratios were converted 
to Log values and plotted to visualize peptides that are significantly perturbed, for 
example by modification, relative to the rest of the protein. A minimum of three 
SILAC ratios for each peptide was required for inclusion in KEAP1 surface maps, 
which allowed for approximately 85-90% coverage of the KEAP1 protein. Missing 
sequences were caused by the lack or close spacing of tryptic sites, resulting in 
inadequate peptides for MS/MS detection. 
In vitro MGO peptide reactions. ‘CR peptide was synthesized using standard 
solid phase peptide synthesis with FMOC-protected amino acids on MBHA rink 
amide resin. Peptides were cleaved in a solution of 94% trifluoroacetic acid, 2.5% 
triisopropyl silane, 2.5% HO, 1% 3-mercaptoethanol and precipitated with ether. 
Peptide identity was confirmed using an Agilent 1100 series LC-MS. Peptides were 
purified via reverse phase HPLC on an Agilent Zorbax SB-C18 250 mm column 
and dried via lyophilization. For MGO reactions CR peptide (1 mM) was incu- 
bated with 12.5 mM MGO (diluted from 40% solution in water; Sigma Aldrich) or 
equivalent amount of water (mock) in 1x PBS pH 7.4 at 37°C overnight. Reactions 
were diluted 1:25 in 95%/5% H»O/acetonitrile and 0.1% trifluoroacetic acid and 
analysed by LC-MS. 

For NMR experiments, approximately 1.5 mg of the CR or CR-MGO crosslinked 
peptide was purified by reverse phase HPLC, lyophilized and dissolved in 700 1l 
ds-DMSO. The peptides were dried via lyophilization. All NMR experiments were 
performed ona Bruker Avance II+ 500 MHz 11.7 Tesla NMR. Data were processed 
and plotted in Bruker Topspin 3.5. CR peptide NMR experiments were run with a 
spectral width of 8.5 for 2D experiments (in both dimensions) and 15 for 1D proton 
NMR with a pulse width of 13.5 1s and an interscan delay of 3 s. For the proton 
NMR, 256 scans were taken. For the COSY-DQF experiment, 128 and 2,048 com- 
plex points were acquired in the F, and F, dimensions respectively, with 8 scans per 
point. For the TOCSY experiment, a mixing time of 601s was used, and 256 and 
1,024 complex points were acquired with 8 scans per point. All CR-MGO peptide 
NMR experiments were run with a spectral width of 13 (in both dimensions) with 
a pulse width of 11.5 1s and an interscan delay of 2.2 s. For the proton NMR, 256 
scans were taken. For the COSY-DQF experiment, 128 and 2,048 complex points 
were acquired in the F, and F, dimensions, respectively, with 8 scans per point. 
For the TOCSY experiment, a mixing time of 80 1s was used, and 256 and 1,024 
complex points were acquired with 8 scans per point. 

In-gel digestion of KEAP1. Targeted proteomic analyses of KEAP1 protein were 
performed by running anti-Flag enriched HMM-KEAP1 and low-molecular mass 
(LMM)-KEAP1 (from both CBR-470-1 or MGO treatments as above) on SDS- 
PAGE gels, and isolated gel pieces were digested in-gel with sequencing grade 
trypsin (Promega), as previously reported*>. Tryptic peptides from in-gel tryptic 
digestions were dissolved in 100 mM Tris-HCl, pH 8.0, with 2 mM of CaCh, and 
further digested with mass spectrometry-grade chymotrypsin (Thermo Scientific) 
according to manufacturer's protocol. Chymotryptic digestion reactions were 
stopped by acidification, and desalted on Ziptip C18 tips. 

Targeted proteomic analysis of crosslinked KEAP1 peptides. Double-digested 
KEAP1 peptides from isolated HMM-KEAP1 and monomeric KEAP1 were 
analysed by LC-MS/MS on an Easy-nLC 1000 ultra-high pressure LC system 
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coupled to a Q Exactive HF orbitrap and Easy-Spray nanosource as indicated above. 
Candidate peptides were initially searched by manual inspection of chromatograms 
and MS1 spectra for m/z values of peptide candidates from predicted digestion 
sites, crosslink sites and differential presence in HMM- and monomeric KEAP1 
from both CBR-470-1-and MGO-treated samples. Extracted MS1 ions of the can- 
didates were present in HMM-KEAP1 digests but not in LMM-KEAP1 digests. 
MS/MS spectra and PRM experiments were collected on the same instrument 
using the following settings: global and general settings included lock masses of off, 
chromatography peak width of 15 s, polarity of positive, in-source CID of 0.0 eV, 
inclusion list set to ‘on; and an m/z value of the target parent ion with its charge state 
in the inclusion list. MS2 scans were performed by HCD fragmentation with micro- 
scans of 1, resolution of 120,000, AGC target of 5 x 10°, maximum IT of 200 ms, 
loop count of 1, MSX count of 1, isolation window of 2.0 m/z, isolation offset of 
0.0 m/z, NCE of 16, and spectrum data type in profile mode. Furthermore, S-lens 
RF level was set to 60 with a spray voltage value of 2.20 kV and ionization chamber 
temperature of 275°C. Targeted PRM experiments were performed on CBR-470-1-, 
MGO-induced HMM-KEAP1 and monomeric KEAP1 samples. 

UVB skin damage model. Thirty-two 5-week-old BALB/c male mice were rand- 
omized into 4 groups of 8 animals such that each group had similar body weight 
means. Mice were prepared for removal of hair from their entire back two days 
before UVB exposure (day 3) by using an electric shaver and depilatory cream. On 
day 5, mice received exposure to UVB (200 mJ cm~”) produced by a broad band 
UVB lamp (Dermapal UVB Rev 2) powered by a Kernel UV Phototherapy system. 
UVB exposure was confined to a rectangular area of ~8 cm” by a lead shielding 
mask. UVB doses were confirmed by dosimeter measurements (Daavlin X96). 
Sham animals were shaved but received no UVB treatment. Mice were dosed from 
day 0 to study end at day 10 via oral gavage twice daily (CBR-470-2, 50 mg kg! 
twice daily, orally; BARD, 3 mg kg! orally; vehicle, 0.5% methyl cellulose/0.5% 
Tween80). Mice were monitored daily for body weight changes and erythema 
scoring from days 5 to 10. Mice were euthanized at day 10 and specimens collected 
for histological analysis from the wounded area. These studies were performed at 
Biomodels, LLC. Blinded erythema scores were recorded by a blinded, trained 
investigator according to established in house scale. In short, a scale of 0 to 4 was 
generated with a score of 0 referring to normal skin and a score of 4 indicating 
severe ulceration. 

Percentage wounded area measurements. Photographs of animals on day 10 
of the study were taken such that the distances from camera, aperture and expo- 
sure settings were identical. Images were then cropped such that only the shaved, 
wounded area encompassed the imaging field. These images were then processed 
with a custom ImageJ macro which first performed a three colour image decon- 
volution to separate the red content of the image*®. The thresholding function 
within Image] software was then used to separate clear sites of wounding from 
red background present in normal skin. Red content corresponding to wounds 
was then quantified as a fraction of the whole imaging field and reported as the 
percentage wounded area. 

Epidermal thickness measurements. H&E-stained skin sections corresponding 
to the wounded area were generated by Histotox Labs and accessed via pathxl 
software. Twenty-four individual measurements of epidermal thickness from 8 
sections spanning a 400-|1m step distance were recorded per animal by a non- 
blinded, trained investigator. These measurements were then averaged to generate 
a mean epidermal thickness measurement per animal. 

Reporting summary. Information on research design is available the Nature 
Research Reporting Summary. 


Data availability 

RNA-seq primary data are deposited in the Gene Expression Omnibus 
(GEO) under accession number GSE116642. Source data for all mouse exper- 
iments have been provided. Full scans for western blots and gels are available 
in the Supplementary Information. All other data are available on reasonable 
request. 
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Extended Data Fig. 1 | See next page for caption. 


© 2018 Springer Nature Limited. All rights reserved. 


Extended Data Fig. 1 | A high-throughput screen identifies a non- 
covalent NRF2 activator chemical series that activate a robust NRF2 
transcriptional program in multiple cell types. a, Plate-based Z-scores of 
ARE-LUC luminance measurements of all test compounds from a 30,000 
compound screen in IMR32 cells. b, Structure of screening hit CBR-470-0. 
c, Relative ARE-LUC luminance measurements from IMR32 cells treated 
for 24 h with a concentration response of CBR-470-0 and reported NRF2 
activators TBHQ and AI-1 (n =3 biologically independent samples, mean 
and s.e.m.). d, LC-MS quantification of CBR-470-1 (501M) incubated 

in the presence or absence of GSH (1 mM) in PBS for 1 h (left) and 48 h 
(right). Relative ion intensities within each time point were compared 
with representative chromatograms shown (n= 2). e, Relative ARE-LUC 
luminance values from IMR32 cells transfected with wild-type (wt) 

or mutant (mt; two core nucleotides necessary for NRF2 binding were 
changed from GC to AT) ARE-LUC reporter constructs and treated with 
the indicated doses of CBR-470-1 for 24 h (n= 3, mean and s.e.m.). 
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f, Relative abundance of NRF2-dependent transcripts as determined by 
qRT-PCR from IMR32 cells treated for 24 h with 511M CBR-470-1 (n = 3). 
g, Western blot analyses of total NRF2 protein content or NRF2-controlled 
genes (NQO1 and HMOX1) from IMR32 cells treated for 24 h with 5 4M 
CBR-470-1 (n=5). h, Western blot analyses of total NRF2 protein content 
from the indicated cell types treated for 4 h with 5 1M CBR-470-1 (n=3). 
i, Relative expression levels of NQO1 and HMOX!1 from the indicated 

cell types treated for 24 h with 5 1M CBR-470-1 (n=3, mean and s.d.). 

j, Relative ARE-LUC luminescence values from HEK293T cells transfected 
with the indicated shRNA constructs and pTI-ARE-LUC and then treated 
with TBHQ (101M) or CBR-470-1 (511M) for 24h (n= 3). k, Relative 
viability measurements of SH-SY5Y cells treated with either CBR-470-1 
(51M) or TBHQ (101M) for 48 h and then challenged with the indicated 
doses of TBHP for 8 h (n=4). Data are mean and s.d. or s.e.m. of 
biologically independent samples. *P < 0.05, **P < 0.005, ***P < 0.001, 
univariate two-sided t-test. 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | A photoactivatable affinity probe-based 
approach identifies PGK1 as the relevant cellular target of CBR-470-1. 
a, Structure of CBR-470-PAP. b, Relative ARE-LUC luminance values 
from IMR32 cells transfected with pTI-ARE-LUC and then treated with 
the indicated doses of CBR-470-PAP for 24 h (n=3). c, Silver staining 
and anti-biotin western blots of ammonium sulfate fractionated lysates 
from UV-irradiated IMR32 cells treated with 51M for 1 h with or without 
CBR-470-1 competition (250 1M) (n =3). Shown on the right are initial 
proteomic target results from gel-band digestion and LC-MS/MS analysis. 
d, Anti-biotin western blots from in vitro crosslinking assays with 
recombinant PGK1 and EBP1 in the presence of the indicated doses of 
CBR-470-PAP (n= 2). e, Anti-biotin western blot analyses from an in vitro 
crosslinking assay with recombinant PGK] in the presence of CBR-470- 
PAP (141M) and indicated concentration of soluble CBR-470-1 competitor 


(n= 2). f, Anti-biotin western blot analyses of cells treated with 5 1M 
CBR-470-PAP after transduction with anti-PGK1 and anti-EBP1 shRNA 
for 48 h. Depletion of PGK1 protein selectively reduces CBR-470-PAP- 
dependent labelling (n = 2). g, Dye-based thermal denaturation assay with 
recombinant PGK1 in the presence CBR-470-1 (201M) or vehicle alone 
(n=3). Calculated melting temperature (Tw) values are listed. h, i, Dose- 
dependent thermal stability assay of recombinant PGK1 and GAPDH in 
the presence of increasing doses of CBR-470-1 near the T,, of both proteins 
(57°C) (n=5) (h) or room temperature (n= 3) (i). Western blot of sample 
supernatants after centrifugation (13,000 r.p.m.) detected total PGK1 

and GAPDH protein, which were plotted in Prism (below). j, ARE-LUC 
reporter activity in HEK293T cells with transient shRNA knockdown of 
ENO1 (n=3). Data are mean + s.e.m. of biologically independent samples. 
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Extended Data Fig. 4 | CBR-470-1 inhibits PGK1 in vitro and in situ. 

a, Schematic of the GAPDH/PGK1 coupled assay. Pre-equilibration of the 
GAPDH reaction (top left) results in an NAD*/NADH equilibrium, which 
after addition of PGK1 and ADP pulls the reaction to the right, producing 
more NADH. Monitoring NADH absorbance after the addition of PGK1 
(bottom right) can be used to monitor PGK] activity in the forward 
direction (right). Kinetic monitoring of NADH absorbance (340 nm) 

after established equilibrium with GAPDH shows little change (black 
curve), but is significantly increased after the addition of PGK1, pulling 
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the equilibrium to the right (red curve). b, CBR-470-1 does not affect the 
GAPDH equilibrium alone, but significantly inhibits PGK1-dependent 
activity and accumulation of NADH (n=5). ¢, d, Relative level of central 
metabolites in IMR32 cells treated with viral knockdown of PGK1 for 

72 h (c) (n= 4) and with CBR-470-1 relative DMSO alone for the 
indicative times (d) (n = 3). Each metabolite is normalized to the control 
condition at each time point. Statistical analysis was by univariate 
two-sided t-test (b). Data are mean + s.e.m. of biologically independent 
samples. 
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Extended Data Fig. 5 | Modulation of PGK1 induces HMM-KEAP1. 

a, Anti-phosphoglyceryl-lysine (pgK) and anti-GAPDH western blots 
analysis of CBR-470-1 or DMSO-treated IMR32 cells at early (30 min) 

and late (24 h) time points (n =6). b, Anti-Flag (left) and anti-pgK (right) 
western blot analysis of affinity purified Flag-KEAP1 from HEK293T cells 
treated with DMSO or CBR-470-1 for 30 min. Duplicate samples were run 
under non-reducing (left) and reducing (DTT, right) conditions (n=6). 

c, Densitometry quantification of total endogenous KEAP1 levels 
(combined bands at approximately 70 and 140 kDa) in IMR32 cells treated 
with DMSO or CBR-470-1 for the indicated times (n = 6). d, Western blot 
detection of Flag-KEAP1 in HEK293T cells comparing non-reducing 
reagent to DTT (left), and the stability of CBR-470-1-dependent HMM- 
KEAP1 to the presence of DTT (12.5 mM final concentration, middle) 

or B-mercaptoethanol (SME; 5% (v/v) final concentration, right) 

during sample preparation. e, Time-dependent CBR-470-1 treatment of 
HEK293T cells expressing Flag-KEAP1. Time-dependent assays were 

run with 20}1M CBR-470-1 with western blot analysis at the indicated 
time-points (n = 8). f, g, Western blot detection (f) and quantification (g) 
of endogenous KEAP!1 and (-actin in IMR32 cells treated with DMSO or 
CBR-470-1 for the indicated times (n = 6). Arrows indicate monomeric 


KEAP!1 (70 kDa) and HMM-KEAP1 (140 kDa) bands. h, i, Western blot 
(h) detection and quantification (i) of Flag-KEAP1 in HEK293T cells 
exposed to increasing doses of CBR-470-1 (n= 3). j, Kinetic RT-PCR 
measurement of NQO1 mRNA levels from IMR32 cells treated with TBHQ 
(101M) or CBR-470-1 (101M) for the indicated times (n = 3). 

k, Quantification of HMM-KEAP1 formation after treatment with CBR- 
470-1 or the direct KEAP1 alkylator TBHQ, in the presence or absence 
of GSH or N-acetylcysteine (NAC) (n =3). All measurements were taken 
after 8 h of treatment in Flag-KEAP1-expressing HEK293T cells. 

1, Transient shRNA knockdown of PGK1 induced HMM-KEAP1 
formation, which was blocked by co-treatment of cells with GSH (n = 3). 
m, Anti-Flag western blot analysis of Flag-KEAP1 monomer and the 
HMM-KEAPI fraction, with dose-dependent incubation of distilled 
MGO in lysates from HEK293T cells expressing Flag-KEAP1 (n= 4). 

n, SDS-PAGE gel (silver stain) and anti-Flag western blot analysis 

of purified KEAP1 treated with MGO under the indicated reducing 
conditions for 2 h at 37°C (n= 3). Purified protein reactions were 
quenched in 4x SDS loading buffer containing 8-mercaptoethanol and 
processed for gel analysis as in d. Data are mean +s.e.m. of biologically 
independent samples. 
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Extended Data Fig. 6 | MGO and glyoxylase activity regulates NRF2 
activation. CBR-470-1 causes increased levels of MGO in cells. 

a, Schematic depicting chemical derivatization and trapping of cellular 
MGO for analysis by targeted metabolomics using two unique fragment 
ions. b, c, Daughter ion fragments (b) and resulting MS/MS quantification 
of MGO levels (c) in IMR32 cells treated with CBR-470-1, relative to 
DMSO (n= 4). d, Quantitative LC-MS/MS measurement of cellular MGO 
levels in IMR32 cells treated for 2 h with CBR-470-1 or co-treated for 2 h 
with CBR-470-1 and N-acetylcysteine (2 mM) relative to DMSO (n=4). 
e, Relative ARE-LUC luminance values from IMR32 cells transfected with 
pTI-ARE-LUC and co-treated with the indicated doses of CBR-470-1 
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and GSH (n= 3). f, Relative levels of NQO1 and HMOX1 transcripts 
from IMR32 cells co-treated with CBR-470-1 (101M) and the indicated 
concentrations of GSH for 24 h (n= 3). g, Fractional ARE-LUC values 
from HEK293T cells transiently co-transfected with pTI-ARE-LUC 
and the indicated shRNAs and then treated for 24 h with the indicated 
doses of CBR-470-1 (n= 3). h, ARE-LUC reporter activity in HEK293T 
cells treated with CBR-470-1 alone (black) and with a cell-permeable 
small-molecule GLO] inhibitor (red) (n = 3). Statistical analysis was by 
univariate two-sided t-test (d, h). Data are mean + s.e.m. of biologically 
independent samples. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | Schematic of SILAC-based proteomic 

mapping of KEAP1 modifications in response to CBR-470-1 and 
NMR characterization of CR-MGO peptide. a, SILAC experiments in 
which Flag-tagged KEAP1 was treated with vehicle (‘light’) and CBR- 
470-1 or MGO (‘heavy’). Subsequent mixing of the cell lysates, anti-Flag 
enrichment, tryptic digestion and LC-MS/MS analysis permitted the 
detection of unmodified portions of KEAP1, which retained approximate 
1:1 SILAC ratios relative to the median ratios for all detected KEAP1 
peptides. By contrast, peptides that are modified under one condition 
will no longer match tryptic MS/MS searches, resulting in skewed SILAC 
ratios that ‘drop out’ (bottom). b, SILAC ratios for individual tryptic 
peptides from Flag~-KEAP1-enriched DMSO-treated light cells and 
CBR-470-1-treated heavy cells, relative to the median ratio of all KEAP1 
peptides. Highlighted tryptic peptides were significantly reduced by 
three- to fourfold relative to the KEAP1 median, indicative of structural 
modification (n = 8). ¢, Structural depiction of potentially modified 
stretches of human KEAP1 (red) using published X-ray crystal structure 
of the BTB (PDB code 4CXI) and KELCH (PDB code 1U6D) domains. 
Intervening protein stretches are depicted as unstructured loops in green. 
d, SILAC ratios for individual tryptic peptides from Flag~KEAP1-enriched 
MGO-treated heavy cell lysates and non-treated light cell lysates, relative 
to the median ratio of all KEAP1 peptides. Highlighted tryptic peptides 
were significantly reduced by 2- to 2.5-fold relative to the KEAP1 median, 
indicative of structural modification (n = 12). e, Representative western 
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blotting analysis of Flag-KEAP1 dimerization from HEK293T cells pre- 
treated with BARD followed by CBR-470-1 treatment for 4 h (n=3). 

f, ‘H-NMR of CR-MGO peptide (isolated product of MGO incubated 
with Ac-NH-VVCGGGRGG-C(O)NH) peptide). 'H-NMR (500 MHz, 
ds-DMSO) 6 12.17 (s, 1H), 12.02 (s, 1H), 8.44 (t, J =5.6 Hz, 1H), 8.32-8.29 
(m, 2H), 8.23 (t, J=5.6 Hz, 1H), 8.14 (t, J=5.9 Hz, 1H), 8.05 (t, J=5.9 
Hz, 1H), 8.01 (t, J=5.9 Hz, 1H), 7.93 (d, J=8.5 Hz, 1H), 7.74 (d, J=8.0 
Hz, 1H), 7.26 (s, 1H), 7.09 (s, 1H), 4.33-4.28 (m, 1H), 4.25-4.16 (m, 3H), 
3.83 (dd, J =6.9 Hz, J = 16.2 Hz, 1H), 3.79-3.67 (m, 6H), 3.63 (d, J=5.7 
Hz, 2H), 3.54 (dd, J =4.9 Hz, J = 16.2 Hz, 1H), 3.18-3.13 (m, 2H), 3.04 (dd, 
J =4.9 Hz, J=13.9 Hz, 1H), 2.88 (dd, J=8.6 Hz, J = 13.6 Hz, 1H), 2.04 
(s, 3H), 1.96 (sep, J=6.8 Hz, 2H), 1.87 (s, 3H), 1.80-1.75 (m, 1H), 1.56- 
1.47 (m, 3H), 0.87-0.82 (m, 12H). g, ‘H-NMR of CR peptide (Ac-NH- 
VVCGGGRGG-C(O)NH,). 'H-NMR (500 MHz, ds-DMSO) § 8.27-8.24 
(m, 2H), 8.18 (t, J=5.7 Hz, 1H), 8.13-8.08 (m, 3H), 8.04 (t, J=5.7 Hz, 
1H), 7.91 (d, J=8.8 Hz), 7.86 (d, J=8.8 Hz, 1H), 7.43 (t, ]=5.4 Hz, 1H), 
7.28 (s, 1H), 7.10 (s, 1H), 4.39 (dt, J =5.6 Hz, J=7.4 Hz, 1H), 4.28 (dt, 
J=5.7 Hz, J=7.2 Hz, 1H), 4.21-4.13 (m, 2H), 3.82-3.70 (m, 8H), 3.64 (d, 
J =5.8, 2H), 3.08 (dt, J=6.5 Hz, J=6.5 Hz, 2H), 2.80-2.67 (m, 2H), 2.43 
(t, J=8.6 Hz, 1H), 1.94 (sep, J =6.8 Hz, 2H), 1.85 (s, 3H), 1.75-1.68 (m, 
1H), 1.54-1.42 (m, 3H), 0.85-0.81 (m, 12H) h, 'H-H total correlation 
spectroscopy (TOCSY) of CR-MGO peptide. i, Peak assignment for CR- 
MGO peptide TOCSY spectrum. Data are mean + s.e.m. of biologically 
independent samples. 
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Extended Data Fig. 8 | MS2 analysis of CR-MGO-crosslinked KEAP1 peptide. a, Targeted PRM transitions (n = 6). b, Annotated MS/MS spectrum 


from the crosslinked C151-R135 KEAP1 peptide. 
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Extended Data Table 1 | Primer sequences for qPCR and cloning experiments 


EPHX] 


CTTCACGTGGATGAAGTGGA CTGGCGGAATGAATTTGACT 
ABCC2 GGGATCTCTTCCACACTGGAT _ | CATACAGGCCCTGAAGAGGA 


GLO] TGGATTAGCGTCATTCCAAG GCGGACCCCAGTACCAAG 
PGK1 CTTGGGACAGCAGCCTTAAT CAAGCTGGACGTTAAAGGGA 
TUBG1 _| ATCTGCCTCCCGGTCTATG TACCTGTCGGAACATGGAGG 


Mutation | Primer (Forward Primer (Reverse 

C238 5'-/SPhos/GCA GGG GAC GCG GTG ATG TAC -3’ ee ng eee 
C388 5'-/SPhos/GCC CTC CCA GCA TGG CAA-3! 5'-/SPhos/GTC ACC TCC GCC TTG GAC TCA GT-3' 
C151$ 5'-/SPhos/TGA ACG GTG CTG TCA TGT ACC AGATC-3' | _5'-/SPhos/TGA CGT GGA GGA CAG ACT TCT CGC -3' 


C2738 5'-/SPhos/CCG AAC TTC CTG CAG ATG CAG CT -3! 5'-/SPhos/CGT CAA CGA GTG GGA GCG CAC G-3 
C288s 5'-/SPhos/GTC CGA CTC CCG CTG CAA GGA CT -3' Eee DOC SOG e TRG te Te Tae ae ie 
C3968 5'-/SPhos/GAC CAA TCA GTG GTC GCC CTG -3! 5'-/SPhos/ATG GGG TTG TAA GAG TCC AGG GC -3’ 
C4058 5'-/SPhos/CGT GCC CCGTAA COG CAT CG -3! 5'-/SPhos/CTC ATG GGG GCG CTG GGC G -3' 


S'- CTC CCA GCA TGG CAA CGC CAC CTT CAG CTA | S'- GTGTAG CTG AAG GTG GCG TTG CCA TGC TGG 
RS50A f ' 
CAC -3 GAG -3 
R135A S'- CCC AAG GTC ATG GAG GCC CTC ATT GAATTC | S'- AGG CGA ATT CAATGA GGG CCT CCA TGA CCT 
GCCT -3' TGGG -3' 
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Extended Data Table 2 | Acquisition parameters used for targeted metabolomic measurements on a triple quadrupole mass spectrometer 
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labelled serine standard included in all runs as an internal normalization control. 
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Immune checkpoint blockade therapy has been successful in 
treating some types of cancer but has not shown clinical benefits 
for treating leukaemia!. This result suggests that leukaemia uses 
unique mechanisms to evade this therapy. Certain immune 
inhibitory receptors that are expressed by normal immune cells 
are also present on leukaemia cells. Whether these receptors can 
initiate immune-related primary signalling in tumour cells remains 
unknown. Here we use mouse models and human cells to show that 
LILRB4, an immunoreceptor tyrosine-based inhibition motif- 
containing receptor and a marker of monocytic leukaemia, supports 
tumour cell infiltration into tissues and suppresses T cell activity via 
a signalling pathway that involves APOE, LILRB4, SHP-2, uPAR and 
ARG1 in acute myeloid leukaemia (AML) cells. Deletion of LILRB4 
or the use of antibodies to block LILRB4 signalling impeded AML 
development. Thus, LILRB4 orchestrates tumour invasion pathways 
in monocytic leukaemia cells by creating an immunosuppressive 
microenvironment. LILRB4 represents a compelling target for the 
treatment of monocytic AML. 

To identify novel mechanisms for AML development and immune 
regulation, we analysed the relationship between gene expression of 
known co-stimulating and co-inhibitory receptors and the overall 
survival of patients with AML as documented in The Cancer Genome 
Atlas (TCGA) database. Expression of the mRNA that encodes leuko- 
cyte immunoglobulin-like receptor B4 (LILRB4), an immune inhibitory 
receptor that is restrictively expressed on monocytic cells?-* and mono- 
cytic AML cells (FAB M4 and M5 AML subtypes)”, ranked at the top of 
the list for negative correlation with patient survival (Fig. la, Extended 
Data Fig. la—d, Supplementary Table 1). Notably, LILRB4 levels were 
higher on monocytic AML cells than on normal monocytes (Fig. 1b). 

The extracellular domain of LILRB4 inhibits T cell activity®. To test 
whether LILRB4 expressed on AML cells suppresses T cells, we cul- 
tured LILRB4-positive leukaemia cells, LILRB4-negative leukaemia 
cells, or normal haematopoietic cells together with either autologous 
T cells or T cells from healthy donors. Only LILRB4-positive mono- 
cytic AML cells substantially suppressed T cell proliferation (Fig. 1c, 
Extended Data Fig. le, f). We then deleted LILRB4 from human 
monocytic AML THP-1 and MV4-11 cells and found that the ability 


of AML cells to suppress T cells was reduced upon LILRB4 knockout 
(LILRB4®°) and was restored by forced expression of wild-type LILRB4 
(LILRB4°-WT), but not by forced expression of a mutant LILRB4 with 
deleted intracellular domain (as LILRB4*°-intA) (Fig. 1d, Extended 
Data Fig. 2a—g). Moreover, when wild-type THP-1 cells and human 
T cells were cultured in separate transwells, LILRB4-mediated T cell 
inhibition was also observed and could be reversed by anti-LILRB4 
blocking antibodies (Extended Data Fig. 2h-p). Blocking LILRB4 
resulted in an increase in T cell cytotoxicity and cytokine release 
(Extended Data Fig. 2q, u). These in vitro data suggest that, instead 
of the extracellular domain*, the intracellular signalling of LILRB4 in 
AML cells is required for suppression of T cell activity. 

Next, we used humanized mouse xenograft models and an immu- 
nocompetent mouse model to investigate LILRB4 function in immune 
checkpoint blockade. Subcutaneous implantation of THP-1 cells—but 
not of LILRB4*° THP-1 cells—resulted in the development of AML in 
human T cell-reconstituted mice, and this was blocked by anti-LILRB4 
treatment’ (Extended Data Fig. 3a-i). Doxycycline-induced deletion of 
LILRB4 in an established disseminated leukaemia model in humanized 
mice also impaired leukaemia development and restored T cells (Fig. le, f, 
Extended Data Fig. 3j-1). In addition, we subcutaneously implanted 
human LILRB4-expressing mouse C1498 AML cells (hLILRB4-C1498) 
into C57BL/6 mice to establish a syngeneic immunocompetent mouse 
model. To exclude the anti-tumour effects of Fc effector functions, we 
treated tumour-bearing mice with anti-LILRB4 with the Fc glycosylation 
site N297A mutation®. Blockade of LILRB4 effectively lowered tumour 
burden and prolonged survival; depletion of CD8* T cells eliminated 
the anti-tumour effects of the anti-LILRB4 antibody (Extended Data 
Fig. 3m-r). These results suggest that the tumour-supportive effect of 
LILRB4 depends on inhibition of host T cells. The anti-LILRB4 anti- 
body treatment generated tumour-specific memory T cells (Extended 
Data Fig. 3s). Similar results were obtained in the disseminated hLIL- 
RB4-C1498 syngeneic mouse model (Extended Data Fig. 3x-z). Finally, 
blockade of LILRB4 reduced leukaemia development in xenografts 
derived from primary human monocytic AML cells (Fig. 1g-i, Extended 
Data Fig. 4a) and increased the number of engraftable autologous human 
T cells (Extended Data Fig. 4b). Together, our in vitro and in vivo results 
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Fig. 1 | LILRB4 expressed on leukaemia cells suppresses T cell 
proliferation. a, Surface expression of LILRB4 quantified by flow 
cytometry analysis of samples from 105 patients with AML (n = 1-34 
for each classification (see Methods), mean + s.e.m.). b, Comparison 

of surface expression of LILRB4 on normal monocytes and neoplastic 
monocytes from the same patients (n = 6 independent patients). MFI, 
mean fluorescence intensity. c, Autologous T cells (pT, patient T cells) 
isolated from a patient with monocytic AML (AML#19) or allogeneic 

T cells (nT, normal T cells) isolated from a healthy donor were incubated 
with irradiated LILRB4* or LILRB4~ (B4* or B47) primary leukaemia 
cells from patient AML#19 (n = 3 biologically independent samples, 
mean + s.e.m.; see Source Data). d, T cells (E, effector cells) isolated from 
healthy donors were incubated with indicated wild-type or modified 
irradiated THP-1 cells (T, target cells) in a cell-contact manner (n = 3 


indicate that LILRB4 signalling in monocytic AML cells suppresses 
T cell-mediated anti-tumour immunity. 

One of the characteristic features of monocytic AML is enhanced 
extramedullary infiltration of tumour cells’. In addition to tumour 


biologically independent samples, mean + s.e.m.). WT, wild-type. 

e, f, Engraftment of human T cells and intravenously transplanted 
doxycycline (Dox)-inducible LILRB4®° THP-1 cells (GFP*) into NOD- 
SCID IL2Ry-null (NSG) mice (m = 5 mice). LV, liver; BM, bone marrow; 
%, percentage of leukaemia cells (GFP*) or human T cells (CD3*) in 
total mouse liver or bone marrow cells. g—i, Representative flow plots 
and quantification of per cent of CD45*LILRB4* cells in bone marrow 
from mice xenografted with human primary monocytic AML cells after 
treatment with anti-LILRB4 antibody or control IgG (n = 8 biologically 
independent samples). Experiment repeated for 16 independent patient 
samples with similar results (Extended Data Fig. 4a). See Methods for 
definition of box plot elements in e-i. All P values from two-tailed 
Student's t-test. 


shrinkage (Extended Data Fig. 3m), antibody blockade of LILRB4 
resulted in a decrease in leukaemic infiltration into internal organs, 
including bone marrow, liver and brain (Extended Data Fig. 3t-v). 
Although anti-LILRB4 antibody treatment did not reduce the size 
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Fig. 2 | LILRB4 promotes migration and infiltration of AML cells. 

a, Comparison of the transendothelial migration abilities of wild-type 
and LILRB4*° THP-1 cells (GFP*) in a transwell assay (n = 3 biologically 
independent samples, mean + s.e.m.). b, Comparison of the short-term 
(20 h) infiltration of wild-type or LILRB4*° THP-1 cells in NSG mice 

(n =5 mice). The numbers of leukaemia cells (GFP*) in liver (LV), spleen 
(SP) and bone marrow (BM) were determined by flow cytometry and 
normalized to number in peripheral blood (PB). c-e, Comparison of long- 
term (21 days) infiltration of wild-type or LILRB4*° THP-1 cells in NSG 
mice (n = 5 mice). c, Percentages of THP-1 cells (hCD45*) engrafted in 
indicated organs at day 21 post-transplant. d, e, Overall survival (d) and 
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body weight (e) over time. f-j, Comparison of transendothelial migration 
(f, n = 3 biologically independent samples, mean + s.e.m.), short-term 

(g, 20 h, n = 5 mice) and long-term (h, 16 days, n = 5 mice) infiltration of 
hLILRB4 C1498 or control C1498 cells (GFP*), and overall survival (i) and 
body weight (j) over time. k, Comparison of short-term (20 h) infiltration 
of indicated wild-type or modified THP-1 cells in NSG mice (n = 5 mice). 
1-n, Comparison of short-term (20 h) infiltration of human primary 
monocytic AML cells (AML#21) in NSG mice (n = 4 mice) after treatment 
with anti-LILRB4 antibody or IgG control. See Methods for definition of 
box plot elements in b, ¢, e, g, h, j-n. All P values from two-tailed Student’s 
t-test except for d, i from long-rank test. 


© 2018 Springer Nature Limited. All rights reserved. 


PBS 


LETTER 


a m Human serum P<0.0001 @ f g 80 
== Mouse serum P=0.0006  80)P<0.0001 WT 0A% 
= - < — LILRB4*° Ph aan ~ — — 
P< 0.0001 é 09%) 
60 : 2 £ 1,500 
= P< 0.0001 —— cls 
s 3 1,300 
7 + = WT 89% ne ee 
TZ ae = LILRB4KO 3 1,198 a 
Oo eae 0.1% 8 0 Sere 5 
20. 5 © bo AES 
: a = Wrab 36 407 40° 40" 407 70? “40% “405 
— LILRB4! i 
eS & & gS & 9 aie Titrant (nM) 
YY YY vy : + ; 
Ro 10° “fot 402,103 104 
APC-conjugated antibodies 
r j . GFP hLILRB4:WT SGP: Apoek 
(=) 5 a : Apoe' 
100 NS NS NS. WT “Apoe’© ’ 
= LILRB4KO ‘3. GFP hLILRB4:WT GFF HULRB4 Anos SEER RLILABAApoek® 
so} ,NS fy 8 GH g 5 APOEK! | [WT ‘© GFP hLILRB4:ApoeXO GFP HLILRB4:ApoeS+ aaa 
8 ! f fl A a = APOEKO2 g A 37.6% © GFP:WT APOE OPC aa — 
Ss fe q 8,000 hua ees -& GFP: Apoe! 607P=0.007P=004 9-478 8 P= 0.0005 
* 4 8 a. | | Apoe'Om 80 P=031 i] ea 8 
a P=0.003 i 62.3% 2 93) a-—8 g 
uw 40 6,000 7] 6]; A 62.3% P< 0.0001 "i — 203/80 "Sg 4 3 
) : f s = 60 = 40 f& |, ais lla & 
2p), R= 00) p< 010001 B WT = oon] Bolsil ¢ x4 
f ‘ ae z 5 
sue P< 0.0001 ovens =p toot a. », 59.8% | & 40 Fd a! 8 la tS 3 
oly § i —|8 © 20 B Ts, 2 
P = 0.0002 for Ko fe} in ie 2 
BEd iff dde deeeeqeqqqqqttqqqeqeqqqqqaccaacacecc 2,000 Apoe® 20 6 0.1 6 |ja_ _s 
C3385 B33 BSBOSHSSSK-NO TANT SBSBSSLLSSBBOS ‘aoa | 67.6% al a 
g PeSrmooo ore shoe usa TIIOWHALSTOS 0 aR 0 0 He 
S 10° 10' 102 10% 104 80:1 40:1 20:1 BM LV SP 
z CD3-APC E:T ratio 


Fig. 3 | APOE is an extracellular binding protein of LILRB4. 

a, Percentages of indicated LILRB reporter cells activated (GFP*) in the 
presence of 10% human serum, 10% mouse serum or PBS control. 

b, Percentages of indicated LILRB reporter cells activated by recombinant 
APOE (10 xg ml~ 1) Red dots indicate PBS treatment of each indicated 
reporter cell line. c, Percentages of LILRB4 reporter cells activated by 
10% mouse serum collected from wild-type or Apoe knockout (Apoe®®) 
mice or PBS control. d, Percentages of LILRB4 reporter cells activated 

by 10 pg ml~! APOE, APOE-POPC, APOAI or APOA1-POPC. 

Mean + s.e.m. (a-d). e, Binding of His-tagged APOE to wild-type and 
LILRB4*° THP-1 cells. f, Binding kinetics of human His-tagged APOE3 
to the extracellular domain of LILRB4 (LILRB4-ECD) were measured 
using MST. Top, fluorescence intensity (Flu. int.) plot and regression of 
the binding; bottom, corresponding residuals (Resi.) versus fits plot. 

g, Percentages of LILRB4 reporter cells activated by wild-type and mutant 
APOE proteins. Mut-N, R142A/K143.A/R145A/K146A/R147A/R150A; 
Mut-Cl, deletion of residues 245-299; Mut-C2, deletion of residues 
279-299. h, Percentages of indicated LILRB4 mutant reporter cells 


of subcutaneous C1498 tumours in C57BL/6 mice depleted of CD8+ 
T cells (Extended Data Fig. 3m), treatment with anti-LILRB4 anti- 
body did lead to a decrease in leukaemia cell infiltration into the liver 
(Extended Data Fig. 3w). We hypothesized that, as well as inhibiting 
T cells, LILRB4 promotes leukaemia infiltration. To test this hypothesis, 
we performed transendothelial migration and homing assays and mon- 
itored leukaemia infiltration relative to LILRB4 expression on leukae- 
mia cells. Human AML THP-1 cells depleted of LILRB4 had lower 
transendothelial migration in vitro than cells that expressed LILRB4 
(Fig. 2a). Deletion of LILRB4 reduced homing and engraftment of AML 
cells to haematopoietic organs (Fig. 2b, c), and resulted in prolonged 
survival of xenografted mice (Fig. 2d) and delayed body weight loss 
(Fig. 2e). By contrast, forced expression of human LILRB4 in mouse 
AML C1498 or WEHI-3 cells had the opposite effects (Fig. 2f-j, 
Extended Data Fig. 5a—e). Antibody-mediated LILRB4 blockade in 
LILRB4-expressing AML cells had the same effects as LILRB4 knockout 
(Extended Data Fig. 5f-t). Leukaemia infiltration depended on LILRB4 
expression and its intracellular signalling in leukaemia cells (Fig. 2k) but 
not the Fc effector functions of the antibody (Extended Data Fig. 5u, 
v). Furthermore, LILRB4 blockade reduced the infiltration ability of 
primary monocytic AML cells (Fig. 2l-n, Extended Data Fig. 4c-e). Our 
results are concordant with previous studies showing that the frequency 
of circulating LILRB4* AML blasts is lower than that of LILRB4~ AML 
blasts® and that LILRB4* chronic lymphocytic leukaemia cells are asso- 
ciated with lymphoid tissue involvement!”. The bone marrow, liver, and 
brain—to which LILRB4* AML cells tend to migrate—are known to 
have certain immune privileges'!~'*. Thus, LILRB4-mediated migra- 
tion, which supports enhanced extramedullary infiltration of mono- 
cytic AML cells, may also contribute to immune evasion. 
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activated by APOE proteins. Data on LILRB4 mutants that interfere with 
APOE activation are highlighted in red (g, h, low-to-high outline and 

line at mean in box plots). i, T cells from healthy donors were incubated 
with indicated irradiated wild-type, LILRB4®° or APOES° THP-1 cells. 

T cells were analysed by flow cytometry after 7 days. j-1, C57BL/6 mouse 
spleen cells (E) were incubated with irradiated human LILRB4-expressing 
(GFP hLILRB4) or control (GFP) C1498 cells (T) at indicated E:T ratios. 
Cells were supplemented with 5% serum from wild-type or Apoe®® mice, 
cultured with anti-CD3 and anti-CD28-coated beads for 60 h, and then 
stained with anti-CD3 antibody. j, Representative flow plots from samples 
at E:T of 20:1. k, Percentages of CD3* T cells (mean + s.e.m.). 1, Effects of 
APOE-POPC rescue of ApoeX° serum. m, Expression of human LILRB4 in 
mouse leukaemia C1498 cells increases leukaemia cell infiltration in wild- 
type recipient mice but not in Apoe*® recipient mice. e, j, Experiments 
were repeated independently three times with similar results. See Methods 
for definition of box plot elements in i, 1, m. All P values from two-tailed 
Student's t-test. NS, not significant. a-d, n = 3 biologically independent 
samples; g-i, k, 1, n = 4 biologically independent samples; m, n = 5 mice. 


The blockade of immune inhibitory and migration functions of 
AML cells by anti-LILRB4 antibodies suggests that these functions 
of LILRB4 are regulated by extracellular mechanisms. Integrin-0,83 
is the ligand for gp49B1, the mouse LILRB4 orthologue’. However, 
a variety of integrin-a8 complexes did not activate human LILRB4 
reporter cells (Extended Data Fig. 6a, b). Unexpectedly, human serum 
and mouse serum were capable of activating the LILRB4 reporter 
but not reporters for other LILRBs (Fig. 3a). Through protein liquid 
chromatography fractionation followed by reporter assays and mass 
spectrometry we identified APOE, which specifically activated 
LILRB4 and mouse PirB reporters (Fig. 3b, Extended Data Fig. 6c-j). 
Serum from wild-type mice, but not Apoe-null mice, activated the 
LILRB4 reporter (Fig. 3c). In addition, both liposome-reconstituted 
APOE protein (APOE-POPC) and lipid-free APOE activated LILRB4 
reporter cells (Fig. 3d). Binding of APOE to THP-1 cells was sig- 
nificantly decreased by deletion of LILRB4 (Fig. 3e). We confirmed 
specific binding of recombinant APOE to LILRB4 using micros- 
cale thermophoresis (MST), surface plasmon resonance (SPR) and 
bio-layer interferometry (Octet). The dissociation constant was 
210 nM as determined by MST (Fig. 3f, Extended Data Fig. 6k, 1). 
Mutagenesis studies showed that the N-terminal domain of APOE, 
and P35 and W106 in the first immunoglobulin domain and Y121 in 
the linker region between two immunoglobulin domains of LILRB4, 
are critical for APOE-mediated activation of LILRB4 (Fig. 3g, h, 
Supplementary Table 2, Extended Data Fig. 6m). 

The finding that APOE activates the immune inhibitory recep- 
tor LILRB4 is consistent with the immunosuppressive function of 
APOE!*!°. To determine whether suppression of T cells by LILRB4 
depends on APOE, we examined proliferation of T cells co-cultured 
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Fig. 4 | LILRB4-mediated intracellular signalling controls AML cell 
migration and T cell suppression. a, Expression and phosphorylation of 
three phosphatases in wild-type and LILRB4*° THP-1 cells. b, Primary 

T cells and irradiated indicated THP-1 cells were cultured in the lower 
and upper chambers, respectively. T cells were analysed by flow cytometry 
after 7 days. n = 4 biologically independent samples. c, d, Knockout of 
SHP2 reduces short-term (20 h) and long-term (21 days) infiltration of 
THP-1 cells in NSG mice (n = 5 mice). e, Upstream transcription factor 
analysis of RNA sequencing (RNA-seq) data generated from LILRB4*° 
and wild-type THP-1 cells (n = 2 biologically independent samples). 
Yellow dots highlight the transcription factors involved in the JAK-STAT 
and NFkB pathways. f, Decreased phosphorylation of IKKa and IKKB 

in LILRB4*° THP-1 cells. g, Decreased NFxB in the nuclear fraction in 
LILRB4-KO THP-1 cells. h, i, An NF«B inhibitor (BAY 11-7082) reversed 


with control or APOES° human AML cells. AML cells deficient in 
APOE restored proliferation of T cells and suppressed migration of 
leukaemia cells (Fig. 3i, Extended Data Fig. 6n-t). Moreover, the per- 
centage of T cells in co-culture was lower when C1498 cells ectopically 
expressing LILRB4 were treated with wild-type mouse serum compared 
to those treated with Apoe®° mouse serum (Fig. 3), k). Addition of 
liposome-reconstituted APOE to a co-culture of mouse spleen cells and 
LILRB4-expressing AML cells decreased the T cell percentage (Fig. 31). 
Furthermore, expression of LILRB4 increased infiltration of C1498 
cells to bone marrow and liver in wild-type mice but not in Apoe®° 
recipients (Fig. 3m). These data indicate that APOE activates LILRB4 
on human monocytic AML cells to suppress T cell proliferation and 
support AML cell migration. 

We sought to identify the signalling downstream of LILRB4 that 
is required for T cell suppression and leukaemia infiltration. The 
phosphatases SHP-1 (also known as PTPN6), SHP-2 (also known as 
PTPN11) and SHIP (also known as INPP5D) can be recruited to the 
intracellular domain of LILRB?. The level of phosphorylation of SHP-2, 
but not of SHP-1 or SHIP, was lower in LILRB4*° AML cells than in 
wild-type cells (Fig. 4a, Extended Data Fig. 7a). Loss of SHP-2, but 
not loss of SHP-1 or SHIP, reversed T cell suppression by THP-1 cells 
(Fig. 4b, Extended Data Fig. 7b-d), and decreased short-term (20 h) 
and long-term (21 days) infiltration of THP-1 cells (Fig. 4c, d). Our 
results suggest that SHP-2 is a mediator of LILRB4 signalling. 

Our ingenuity pathway analysis showed that the activity of the key 
transcription factors NFKB1 and RELA in the NFB pathway’, which 
is positively regulated by SHP-2!8, was strongly inhibited by loss of 
LILRB4 (Fig. 4e, Supplementary Tables 3, 4). Consistently, phospho- 
rylation of IKKa/@ and levels of nuclear NF«B were decreased in 
LILRB4®° AML cells (Fig. 4f, g, Extended Data Fig. 7a). Inhibition 
of NF«B signalling reversed T cell suppression and reduced AML cell 
infiltration in a LILRB4-dependent manner (Fig. 4h, i, Extended Data 


608 | NATURE | VOL 562 | 25 OCTOBER 2018 


T cell suppression by THP-1 cells (h) and decreased infiltration of MV4-11 
cells (i) in an LILRB4-dependent manner (n = 4 biologically independent 
samples). j, T cells isolated from healthy donors were supplemented with 
25% conditioned medium (CM) from wild-type or LILRB4*° THP-1 cells. 
Left, representative cells (scale bar, 100 j1m); right, T cells were analysed 
by flow cytometry (n = 4 biologically independent samples). k, 1, T cells 
were incubated with irradiated indicated THP-1 cells supplemented with 
indicated concentrations of recombinant uPAR (k) or ARGI1 (1) proteins 
for 7 days and were analysed by flow cytometry (n = 4 biologically 
independent samples). m, Overexpression of uPAR (PLAUR) or ARG1 
rescued the infiltration defect in LILRB4"° MV4-11 cells (n = 5 mice). 

a, f, g, j, Experiments repeated independently three times with similar 
results. See Methods for definition of box plot elements in b-d, h-m. 

All P values from two-tailed Student's t-test. 


Fig. 7e, f). Therefore the effects of LILRB4 are mediated through the 
NF&«B pathway, which is particularly robust in monocytic AML among 
AML subtypes”. 

Consistent with our result that AML cells inhibit T cell prolifera- 
tion in transwells (Extended Data Fig. 20, p), conditioned medium 
from wild-type THP-1 cells suppressed T cell activity but conditioned 
medium from LILRB4*° cells did not (Fig. 4j). Among proteins that 
were present at higher levels in the conditioned medium of wild-type 
THP-1 cells than in that of LILRB4*° cells (Extended Data Fig. 7g-i), 
the urokinase receptor uPAR is highly expressed by monocytic AML 
cells*°. uPAR, an NEKB target, is known to promote cancer invasion, 
metastasis, survival and angiogenesis”’”*. The addition of recombinant 
uPAR decreased proliferation of T cells co-cultured with LILRB4®° 
THP-1 cells in a dose-dependent manner (Fig. 4k, Extended Data 
Fig. 7j). This activity of uPAR is likely to be mediated by downstream 
effectors in AML cells because uPAR does not effectively decrease 
T cell proliferation directly (Extended Data Fig. 7k). 

The expression of arginase-1 (ARG1), as with that of uPAR, was 
lower in LILRB4*° AML cells than in wild-type cells (Extended Data 
Fig. 7], m). ARG] inhibits T cell proliferation and can be upregulated by 
uPAR-mediated signalling’*4, APOE” and NF«B”*. We hypothesized 
that ARG] is a key downstream effector of LILRB4-NFKB-uPAR sig- 
nalling. ARG can be secreted by AML cells to inhibit T cell activity””. 
Recombinant ARG1 decreased T cell proliferation in co-culture with 
LILRB4*°, APOE®®° and SHP2®° AML or primary AML cells (Fig. 41, 
Extended Data Fig. 7n—p). Moreover, addition and overexpression of 
either uPAR or ARGI rescued the migration ability of LILRB48° AML 
cells in vitro and in vivo, respectively (Extended Data Fig. 7q, Fig. 4m). 
Together, our results indicate that LILRB4-SHP-2-NFKB-uPAR 
ARG1 isa signalling pathway in monocytic AML cells (Extended Data 
Figs. 8, 9) that suppresses immune activity and supports leukaemia 
migration. 
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Because LILRB4 is restrictively expressed on normal monocytic 
cells’, in which LILRB4 signalling may differ from that in leukaemia 
cells (Extended Data Fig. 9), and LILRB4 blockade did not signifi- 
cantly interfere with normal haematopoietic function (Extended Data 
Fig. 10), LILRB4 targeting may have minimal toxicity. Notably, LILRB4 
is also expressed on certain other types of cancer, myeloid-derived sup- 
pressor cells, tolerogenic dendritic cells and tumour-associated mac- 
rophages”>”8-°, Targeting LILRB4 may thus enable a combination of 
immunotherapy and targeted therapy in cancer treatment. 
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METHODS 

Mice. C57 BL/6J and NOD-SCID IL2R+y-null (NSG) mice were purchased from 
and maintained at the animal core facility of University of Texas Southwestern 
Medical Center (UTSW). Apoe®® (Apoe!”™!4"°) mice*! were purchased from the 
Jackson Laboratory. Animal work described in this manuscript has been approved 
and conducted under the oversight of the UT Southwestern Institutional Animal 
Care and Use Committee (IACUC). For each experiment, the same sex- and age- 
matched (4-8 weeks) mice were used and randomly allocated to each group; and 
for tumour size measurement and in vivo lumina imaging experiments, exper- 
imenters were blinded to the treatment conditions of the mice. The minimum 
number of mice in each group was calculated based on results from our prior 
relevant studies®2-*°, For the subcutaneous tumour model, the tumour size was 
calculated as (width x width x length) cm?. The maximal tumour measurement 
permitted by UTSW IACUC is 2 cm in diameter. In none of the experiments were 
these limits exceeded (see Source Data). We complied with all relevant ethical 
regulations and used approved animal study protocols. 

Cell culture. 293T cells were cultured in Dulbecco's modified Eagle’s medium 
(DMEM) supplemented with 10% fetal bovine serum (FBS) at 37°C in 5% CO2 
and normal O2. Human umbilical vein/vascular endothelium cells (HUVECs) 
(ATCC, CRL-1730) were cultured in endothelial cell growth medium plus growth 
factor, cytokines and supplements (EGM-BulletKit, Lonza) at 37°C in 5% CO2 
and normal O. Human monocytic AML cells (THP-1 (ATCC, TIB-202), MV4-11 
(ATCC, CRL-9591), and U937 (ATCC, CRL-1593.2)) and mouse AML cells 
(WEHI-3 (ATCC, TIB-68)) were cultured in Roswell Park Memorial Institute 
(RPMI) 1640 supplemented with 10% FBS at 37°C in 5% CO, and normal Op. 
Mouse AML cells (C1498 (ATCC, TIB-49)) were cultured in DMEM supplemented 
with 10% FBS at 37°C in 5% CO, and normal O). All cell lines were routinely tested 
using a mycoplasma-contamination kit (R&D Systems). 

Primary human leukaemia cells. Primary human AML and B-cell acute lympho- 
blastic leukaemia (B-ALL) samples were obtained from the tissue banks at UTSW 
and University of Texas MD Anderson Cancer Center (MDACC). Informed con- 
sent was obtained under protocols reviewed and approved by the Institutional 
Review Board at UTSW and MDACC (IRB STU 122013-023 by UTSW and 
LAB10-0682 by MDACC). The UTSW cohort included 105 patients with AML 
representative of AML subtypes by the French-American-British (FAB) classi- 
fication, acute myeloblastic leukaemia with minimal maturation (M1, n = 9), 
acute myeloblastic leukaemia with maturation (M2, n = 34), acute promyelocytic 
leukaemia (M3, n = 10), acute myelomonocytic leukaemia (M4, n = 34), acute 
monocytic leukaemia (M5, 1 = 25), acute erythroid leukaemia (M6, n = 2), and 
acute megakaryoblastic leukaemia (M7, n = 1) and patients with undifferentiated 
leukaemia (AUL; n = 1) and transient myeloproliferative disorder (TAM; n = 2) 
(Supplementary Table 1). Samples were frozen in FBS with 10% DMSO and stored 
in liquid nitrogen. Primary leukaemia samples used in patient-derived xenografts 
(PDXs), co-culture, short-term infiltration assay and western blotting were sum- 
marized in Supplementary Table 5. We have complied with all relevant ethical 
regulations with approved study protocols. 

Human normal monocytes and macrophages. Human normal monocytes (CD14 
cells) were isolated by the AutoMACS Pro Separation System (Miltenyi Biotech) 
from the mononuclear cells fraction of normal peripheral blood. In brief, buffy 
coat was purchased from Interstate Blood Bank and the mononuclear cell layer was 
separated by Ficoll Hypaque (17144003, GE Lifesciences) density gradient separa- 
tion. Mononuclear cells were treated with red blood cell lysis buffer to remove red 
blood cells and then incubated with CD14 microbead-conjugated antibody (130- 
050-201, Miltenyi Biotech) for 15 min at 4°C. CD14-positive cells were then isolated 
using the positive selection program according to the manufacturer's protocol. One 
million CD14" cells were plated in macrophage culture medium, Iscove’s modified 
Dulbecco’s medium (IMDM) (12440053, Thermo Fisher) supplemented with 10% 
human AB serum (MT35060CI, Fisher Scientific), 1% NEAA (11-140-050, Fisher), 
2M t-alanine-L-glutamine (SH3003402, Fisher), per each well of a 6-well plate and 
cultured for 7 days. After incubation, most of the cells were adherent to the plastic 
surface and stained positive for CD14 and other markers specific for macrophages. 
TCGA analyses. Data were obtained from the TCGA acute myeloid leukaemia 
database (version: 16 August 2016). The patients were classified into AML sub- 
types (FAB classification) MO (undifferentiated acute myeloblastic leukaemia) 
(n= 16), M1 (n = 42), M2 (n = 39), M3 (n = 16), M4 (n = 35), M5 (n = 18), M6 
(n = 2), M7 (n= 3); two cases were not classified by subtype. The mRNA levels of 
indicated genes were determined by RNA-seq (polyA+ IlluminaHiSeq). RESM- 
normalized counts are reported, and data were analysed and visualized with UCSC 
Xena (https://xena.ucsc.edu/). For analysis of overall survival, 160 patients with 
available survival data were separated into three groups based on whether they had 
high, moderate or low gene expression and then analysed by Xena Kaplan-Meier 
plot (http://xena.ucsc.edu/survival-plots/). 

Flow cytometry. Primary antibodies including anti-human CD45-PE 
(BD Pharmingen, HI30, 1:100), CD45-FITC (BD Pharmingen, HI30, 1:100), 


CD45-APC (BD Pharmingen, H130, 1:100), anti-human CD34-FITC (BD 
Pharmingen, 55582, 1:100), anti-human CD19-PE (eBioscience, HIB19, 1:100), 
anti-human CD20-PE (BD Pharmingen, 555623, 1:100), anti-human CD11b- 
APC (eBioscience, ICRF44, 1:100), anti-human LILRB4-APC (eBioscience, 
ZM4.1, 1:100), anti-human LILRB4-PE (Biolegend, ZM4.1, 1:100), anti-human 
CD14-APC (eBioscience, 61D3, 1:100), anti-human CD33-APC (Biolegend, P67.6, 
1:100), anti-human CD4-APC (eBioscience, RPA-T4, 1:100), anti-human CD3- 
FITC (BioLegend, HIT3a, 1:100), anti-human CD3-Pacific blue (BD Pharmingen, 
SP34-2, 1:100) anti-human CD8-PE (BD Pharmingen, 555367, 1:100), anti-human 
CD28-APC (eBioscience, CD28.2, 1:100), anti-human CD40L-APC (eBioscience, 
24-31, 1:100), anti-human PD1-APC (Biolegend, EH12.2H7, 1:100), anti-human 
TIM3-APC (eBioscience, F38-2E2, 1:100), anti-human TIGIT-APC (eBioscience, 
MBSA43, 1:100), anti-human LAG3-APC (eBioscience, 3DS223H, 1:100), anti- 
human FasL-PE (eBioscience, 24-31, 1:100), anti-uPAR-APC (Biolegend, VIM5, 
1:100), anti-mouse CD3-APC (BioLegend, 17A2, 1:200), anti-mouse CD8a-PE 
(BioLegend, 53-6.7, 1:200), anti-mouse CD45-PE (BD Pharmingen, 30-F11, 1:200), 
anti-mouse CD49b-APC (eBioscience, DX5, 1:200), anti-mouse CD49f-PE (eBi- 
oscience, GoH3, 1:200), anti-mouse CD11b-APC (BioLegend, M1/71, 1:200), 
anti-mouse CD11b-PE (BioLegend, M1/71, 1:200), anti-mouse CD11c-APC 
(eBioscience, N418, 1:200), anti-mouse F4/80-APC (BioLegend, BM8, 1:200), anti- 
His-tag-APC (R&D Systems, AD1.1.10, 1:400), and IgG isotype-control-APC (eBi- 
oscience, P3.6.2.8.1, 1:400) antibodies were used. Cells were run on either Calibur 
for analysis or FACSAria for analysis and sorting. Flow data were analysed by 
Flowjo software. For analysis of human haematopoietic engraftment in NSG mice, 
a previously published protocol was followed*>”, Propidium iodide (PI) staining 
was used to exclude dead cells in analysis and sorting. For intracellular staining, 
we followed the two-step protocol for fixation/methanol from eBioscience. In 
brief, human primary AML cells were stained for surface expression of LILRB4 
(anti-LILRB4-Alexa Fluor 647, Biolegend, ZM4.1, 1:100) and CD33 (anti-human 
CD33-FITC, Biolegend, HIM3-4, 1:100) and fixable cell viability dye eFluor 450 
(Bioscience, Cat#65-0863-14, 1:100) followed by fixation (IC fixation buffer, 
eBioscience, Cat#00-8222) and methanol treatment. After that, cells were stained 
for intracellular antigens by anti-p-SHP-2 (Y580)-PE (Cell Signaling, Cat#13328S, 
1:100), anti-pIKKa/8 (S176/180) (16A6) (Cell Signaling, Cat#2697, 1:100), anti- 
NF«B (S529)-PE (eBioscience, B33B4WP, 1:100), anti-uPAR-PE (Biolegend, VIM5, 
1:100), anti-arginase-1 (D4E3M) (Cell Signaling, Cat#93668, 1:100), rabbit IgG iso- 
type control-PE (Cell Signaling, Cat#5742, 1:100), mouse IgG isotype control-PE 
(eBioscience, m2a-15F8, 1:100) and anti-rabbit IgG-PE (Jackson Immunoresearch 
Laboratory, Cat#111-116-144, 1:400) for flow cytometry analysis. 

Virus construction and infection. For retrovirus packaging, plasmid con- 
structs XZ201-IRES-GFP and XZ201-human LILRB4 (hLILRB4)-IRES-GFP 
were mixed with PCL-ECO (2:1), followed by transfection into 293T cells using 
Lipofectamine 2000 (Invitrogen). For lentivirus packaging, CRISPR-Cas-9-based 
guide RNA (gRNA) constructs and other constructs for gene overexpression, 
including pLentiLox3.7-luciferase-IRES-GFP, ZsGreen-hLILRB4 and ZsGreen- 
hLILRB4-intA, pLVX-PLAUR-IRES-tdTomato and pLVX-ARG1-IRES-tdTomato 
were mixed with psPAX2 and pMD2.G (Addgene) at a ratio of 4:3:1 and transfected 
into 293T cells using Lipofectamine 2000 (Invitrogen). Virus-containing superna- 
tant was collected 48-72 h post-transfection and used for infection as previously 
described**. 

CRISPR-Cas9-based gene knockout in AML cells. Human AML cells were 
infected with doxycycline-inducible Cas9-expressing lentivirus (pCW-Cas9, 
Addgene 50661). After 1 j1g/ml puromycin selection, surviving cells were infected 
with sgRNA-expressing lentivirus, produced by the plasmid modified from 
pSLQ1651 (Addgene 51024) by replacing the puro-mcherry with GFP for sort- 
ing. Scramble control sgRNA (sgRNA 5/-GAACGACTAGTTAGGCGTGTA-3’), 
LILRB4 targeting sgRNA (sgRNA1 5'-TGTTACTATCGCAGCCCTGT-3’; sgRNA2 
5!-GTAGGTCCCCCCGTGCACTG-3’; sgRNA3 5’-CCTGTGACCTCAGTGCAC 
GG-3’), APOE targeting sgRNA (sgRNA1 5’-CTT TTGGGATTACCTGCGC-3’; 
sgRNA2 5'‘-AACTGGCACTGGGTCGCTTT-3’), SHP1 targeting sgRNA (sgRNA1 
5/-TAAGACCTACATCGCCAGCC-3’; sgRNA2 5‘-GAAGAACTTGCACCAGCG 
TC-3’), SHP2 targeting sgRNA (sgRNA1 5’-GAGACTTCACACTTTCCGTT-3’; 
sgRNA2 5'/-TACAGTACTACAACTCAAGC-3’), SHIP targeting ssRNA 
(sgRNA1 5’-CACGCAGAGCGCGTATGCCC-3’; sgRNA2 5’-TGGCAA 
CATCACCCGCTCCA-3’) which were designed by an online tool (http://crispr. 
mit.edu), were cloned into the sgRNA plasmid, individually. After being treated 
with 1 j1g/ml doxycycline (Sigma, Cat#PHR1789) for 1 week, these cells were 
stained with anti-LILRB4 antibody and the LILRB4 negative cells were sorted as 
LILRB4® cells. For APOE™, SHP1®°, SHP2*° and SHIP® cells, GFP* cells were 
sorted into a 96-well plate as a single cell per well. After cell expansion, knockout 
cells were verified by western blotting. For in vivo induction of CRISPR-Cas9 
to achieve gene knockout, we fed mice with doxycycline as described’. In brief, 
7 days after Cas9/LILRB4-sgRNA-transfected THP-1 cell implantation, mice were 
treated with 2 mg per mouse of doxycycline via gavage daily for 5 days to achieve 
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Cas9 expression in engrafted leukaemia cells. The knockout was validated by flow 
cytometry. 

Leukaemia cell and T cell co-culture assay. In the co-culture assay, human 
T cells (5 x 10* per well) isolated from health donor peripheral blood (PB009-1-0, 
Allcells) were mixed with irradiated (28 Gy) indicated human leukaemia cells in 
a U-bottom 96-well plate. For non-contact co-culture of T cells with leukaemia 
cells, leukaemia cells were cultured in the upper chamber of transwell inserts (pore 
size, 3 uM, #09-761-80, Thermo Fisher) in U-bottom 96 well-plates. T cells isolated 
from healthy donors were placed in the lower chambers of a 96-well transwell plate. 
Irradiated indicated leukaemia cells (E:T ratio = 2:1 if not indicated) were added to 
the upper chambers and treated with indicated antibodies, proteins and reagents. 
After culture with anti-CD3/CD28-coated beads (11161D, Thermo Fisher) and 
50 U/ml rhIL-2 for 5-7 days, representative cells were photographed using 
an inverted microscope, and T cells were stained with anti-CD3 antibodies and 
analysed by flow cytometry. 

For primary AML or B-ALL samples, patient leukaemia cells were sorted as 
CD33* and CD19* for AML and B-ALL, respectively. These leukaemia cells were 
cultured with autologous CD3* T cells from the same patient or allogeneic T cells 
from a health donor (E:T ratio = 2:1). After culture with anti-CD3/CD28-coated 
beads (11161D, Thermo Fisher) and 50 U/ml rhIL-2 for 14 days, representative 
cells were photographed using an inverted microscope, and T cells were stained 
with anti-CD3, anti-CD4 and anti-CD8 antibodies and analysed by flow cytometry. 

For cytotoxicity assay, human CD8* T cells (5 x 10* per well) isolated from 
peripheral blood mononuclear cells (PBMCs) from a healthy donor were stim- 
ulated with anti-CD3/CD28/CD137-coated beads (11163D, Thermo Fisher) for 
2 days in a 96-well plate. Then, indicated 5 x 10° leukaemia cells and 50 to 500 jig/ml 
anti-LILRB4 antibodies or control IgG were added. Cell numbers were deter- 
mined on day 7 in triplicate wells. Alternatively, indicated leukaemia cells in indi- 
cated E:T ratios were cultured with T cells for 4-6 h in triplicate wells. Anti-CD3 
and anti-CD8 were used to detect human CTL cells; indicated live THP-1 cells 
were positive for GFP and negative for PI. Cell supernatants from co-cultures 
of stimulated CTL cells and THP-1 cells treated with anti-LILRB4 or IgG were 
used to examine cytokine production using human cytokine arrays (AAH-CYT-6, 
RayBiotech). 

For co-culture of mouse leukaemia and T cells, spleen cells from wild-type 
C57BL/6 mice were co-cultured with 2.5 x 10* irradiated (28 Gy) mouse leukaemia 
C1498 cells in a U-bottom 96 well-plate for 60 h. Anti-CD3/CD28-coated beads 
(11452D, Thermo Fisher), 50 U/ml recombinant human IL-2, and 5% serum from 
wild-type C57BL/6 mice or from Apoe®° mice were added to the medium. In some 
experiments, 50 jug/ml lipid-bound APOE proteins (APOE-POPC) were added 
to the medium. The lipidation of APOE recombinant protein was conducted as 
described*. 

Transendothelial migration assays. To measure the ability of AML cells to migrate 
through endothelial cells, 3 x 10° HUVECs were cultured on a transwell membrane 
(pore size 8 jum). After 3 days, 1 x 10° indicated leukaemia cells were seeded in 
the upper chamber. In indicated experiments, leukaemia cells were treated with 
antibodies or proteins in the upper chamber. After 18 h, cells in the lower chamber 
were counted. 

Short-term infiltration assay of leukaemia cells and homing assay of haemato- 
poietic stem/progenitor cells. Cells (5 x 10° cells per mouse) were injected intra- 
venously into NSG mice. Animals were treated with 10 mg/kg of anti-LILRB4 
antibodies or control IgG immediately after injection of leukaemia cells. Mice were 
euthanized by CO) asphyxiation and death was assured by cervical dislocation 
after 20 h. Peripheral blood, bone marrow, liver and spleen were collected, and 
single-cell suspensions were examined by flow cytometry. CFSE, GFP or indicated 
markers such as anti-human CD45 and anti-human CD33 were used to detect 
target leukaemia cells in indicated experiments. Numbers of leukaemia cells in 
recipient liver, spleen and bone marrow are reported as a ratio relative to cell num- 
bers in peripheral blood. 

To test the infiltration ability of mouse leukaemia cells, 5 x 10° C1498 GFP 
hLILRB4 cells or C1498 GFP cells were injected intravenously into wild-type 
C57BL/6 or Apoe-null mice. Mice were euthanized after 20 h. GFP was used to 
detect leukaemia cells by flow cytometry. The number of leukaemia cells in recip- 
ient liver, spleen and bone marrow were normalized to numbers in peripheral 
blood, and are reported as a ratio. 

To test the homing ability of haematopoietic stem/progenitor cells (HSPCs), 
1 x 10” human cord blood mononuclear cells were injected intravenously into an 
NSG mouse. Mice were treated with 10 mg/kg of anti-LILRB4 antibodies or control 
IgG immediately after injection of mononuclear cells and were killed after 20 h. 
Anti-human CD45 and anti-human CD34 were used to detect human HSPCs by 
flow cytometry. Similarly, to test the infiltration ability of normal human mono- 
cytes, 5 x 10° CD14-positive selected monocyte from health donor PBMC were 
labelled by CFSE and injected intravenously into an NSG mouse. Mice were treated 
with 10 mg/kg of anti-LILRB4 antibodies or control IgG immediately after injection 
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of monocytes and were euthanized after 20 h. CFSE-positive cells were analysed 
by flow cytometry. 

Innate immune cell depletion. NK cell depletion was done by intraperitoneal 
injection of 50 1l anti-asialo GM1 antibodies (CL8955, Cedarlane) 3 days before 
leukaemia cell implantation, which resulted in >90% depletion of CD45*CD49bt 
NK cells in the circulation of NSG mice. Macrophages were depleted by treat- 
ing NSG mice with clodronate (dichloromethylene bisphosphonate) liposomes 
(SKU8909, Clodrosome) (200 jl of stock solution 3 days before leukaemia cell 
implantation), resulting in >70% depletion of CD45*CD11b* F4/80* macrophages 
in the circulation of NSG mice. NSG mice were rendered neutropenic by intra- 
peritoneal injection of 200j.g anti-Ly-6G mAb (BP0075-1, Bioxcell) on days -3, 
-2, -1, and 0 after leukaemia cell implantation, resulting in >80% depletion of 
CD45.1*CD11b*CD11c" neutrophils in the circulation of NSG mice. 

Human AML xenografts. Xenografts were performed essentially as described”>*”. 
In brief, 6-8-week-old NSG mice were used for transplantation. Human leukaemia 
cells (1 x 10°) were resuspended in 200 jl PBS for each mouse intravenous injec- 
tion. Mice were immediately given 10 mg/kg of anti-LILRB4 antibodies or control 
IgG intravenously. Three to four weeks after transplantation, the peripheral blood, 
bone marrow, spleen and liver were assessed for engraftment. Leukaemia growth 
was monitored over time by luminescence imaging (maximum, 3 x 108 p/sec/cm*/ 
sr; min, 5 x 10° p/sec/ cm2/sr). For survival curve experiments, death was recorded 
when moribund animals were euthanized. For primary PDXs, each NSG mouse 
was given 5-10 x 10° human primary peripheral blood or bone marrow mononu- 
clear cells, which contain leukaemia cells and other normal compartments such as 
normal haematopoietic stem progenitor cells and autologous T cells, via tail-vein 
injection. Mice were immediately given 10 mg/kg of anti-LILRB4 antibodies or 
control IgG intravenously and were treated twice a week until euthanization. For 
AML#11, mice were given 10 mg/kg of anti-LILRB4 antibodies or control IgG 
intravenously 7 days after leukaemia cell implantation and were treated twice a 
week until euthanization. Leukaemia growth was monitored over time by flow 
cytometry of human cells in peripheral blood. The presence of more than 1% of 
human leukaemia cells in mouse tissue was considered successful engraftment of 
primary AML cells. One to four months after transplantation, the peripheral blood, 
bone marrow, spleen, and liver were assessed for engraftment. 

For the human PBMC (hPBMC)-humanized model, 1 x 10” hPBMCs were 
injected intravenously into each NSG mouse. Three weeks after implantation, 
mice had 30 to 50% engraftment of human T cells. At 3 weeks post implantation, 
1 x 10°human AML THP-1 cells, including wild-type or LILRB4®° THP-1 cells or 
THP-1 cells stably expressing luciferase (THP-1-Luc-GFP cells) were subcutane- 
ously implanted. Mice were immediately given 10 mg/kg of anti-LILRB4 antibodies 
or control IgG intravenously and were treated twice a week until euthanization. 
Tumour growth was monitored over time by luminescence imaging (maximum, 
1x 108 p/sec/: cm?/sr; min, 5 x 10° p/: sec/cm?/sr). Tumour sizes were determined 
by caliper measure (width x width x length). For the inducible LILRB4 knockout 
experiment, 1 x 10° Cas9/LILRB4-sgRNA-transfected THP-1 cells were injected 
intravenously into each NSG mouse, immediately followed by intravenous injec- 
tion of 0.5 x 10° isolated human normal T cells from health donors. Seven days 
after THP-1 and T cell implantation, mice were treated with 2 mg per mouse of 
doxycycline via gavage daily for 5 days to achieve Cas9 expression in engrafted 
THP-1 cells. At 3 weeks post implantation, the peripheral blood, bone marrow, 
spleen, and liver were assessed for engraftment. 

For the human cord blood (hCB) xenograft model, 2 x 10 CD34* hCB cells 
were injected intravenously into each NSG mouse. Six weeks after implantation, 
mice had 10 to 50% engraftment of human cells. THP-1 cells (1 x 10°) that stably 
express luciferase were intravenously implanted. Mice were immediately given 
10 mg/kg of anti-LILRB4 antibodies or control IgG intravenously. Tumour growth 
was monitored over time by luminescence imaging (maximum, 1 x 10° p/sec/ 
cm?/sr; min, 5 x 10° p/sec/cm?/sr). Lineages of human normal blood cells were 
analysed by flow cytometry. 

Mouse AML allograft. The procedure for mouse AML allografts was similar to 
that for human AML xenografts. In brief, 6-8-week-old wild-type C57BL/6 mice 
were used for transplantation. Mouse leukaemia cells (1 x 10°) expressing human 
LILRB4 were resuspended in 200 11 PBS for each mouse for intravenous or subcuta- 
neous implantation. Mice were given 10 mg/kg of anti-LILRB4-N297A antibodies 
or control IgG intravenously 7 days after leukaemia cell implantation and were 
treated twice a week until euthanization. Three weeks after transplantation, the 
peripheral blood, bone marrow, spleen and liver were assessed for engraftment. For 
subcutaneously implanted mice, tumour sizes were determined by caliper meas- 
ure (width x width x length). For survival curve experiments, death was recorded 
when moribund animals were euthanized. For CD8* T depletion, 10 mg/kg 
anti-CD8 antibodies (YTS 169.4.2, Bioxcell) were injected intravenously 3 days 
after leukaemia cell implantation and mice were treated for an additional 
2 times every 3 days. To determine whether anti-LILRB4 antibody treatment gen- 
erated tumour-specific memory T cells against the tumour or against LILRB4, 
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we conducted adoptive transfer of spleen cells (5 x 10° cells per mouse) from 
anti-LILRB4 treated mice into normal recipient C57BL/6 mice. Four out of five 
transplanted mice rejected the control C1498-GFP mouse leukaemia cells, and 
these mice were not susceptible to rechallenge with threefold higher numbers 
(3 x 10° cells per mouse) of C1498-GFP leukaemia cells. Of 5 mice that received 
adoptive transfer of spleen cells from naive mice, none rejected the control C1498- 
GFP mouse leukaemia cells. 

Chimeric receptor reporter assay. We constructed a stable chimeric receptor 
reporter cell system as described** to test the ability of a ligand to bind to the ECD 
of individual LILRBs, PirB, gp49B1 and LILRB4 site mutants and to trigger the acti- 
vation or inhibition of the chimerically fused intracellular domain of paired immu- 
noglobulin-like receptor 8, which signals through the adaptor DAP-12 to activate 
the NFAT promoter. If an agonist or antagonist binds the ECD and activates or 
suppresses the chimeric signalling domain, an increase or decrease, respectively, 
in GFP expression is observed. A competition assay was used to screen LILRB4 
blocking antibodies. In brief, APOE proteins (C102, Novoprotein; 10 jg/ml) 
or human AB serum (10%, diluted in PBS) were pre-coated onto 96-well plates 
at 37°C for 3 h. After two washes with PBS, 2 x 104 LILRB4 reporter cells were 
seeded in each well; meanwhile, indicated anti-LILRB4 antibodies were added 
into the culture medium. After 16 h, the percentage of GFP* reporter cells was 
analysed by flow cytometry. The threshold of activation is 2 times that of negative 
control treatment. 

Fast protein liquid chromatography and mass spectrometry. Ten per cent 
human AB serum in PBS was loaded onto a 16/60 Superdex 200 gel filtration 
column and eluted with PBS and 2 mM EDTA. Eighty fractions (40 ml) were 
collected, and each fraction (0.5 ml) was analysed by chimeric receptor reporter 
assay. The active fractions (#26-30) were loaded onto PAGE-gel and processed 
to liquid chromatography with tandem mass spectrometry (LC-MS/MS) anal- 
ysis (Orbitrap Elite) for protein identification in the UTSW proteomics core. 
Recombinant or purified proteins used for validation were ZA2G (MBS145455, 
MyBioSource), AMBP (13141-H08H1, Sino Biological), TTHY (12091-H08H, 
Sino Biological), PEDF (11104-H08H, Sino Biological), A2MG (MBS173010, 
MyBioSource), HEMO (MBS143111, MyBioSource), ANGT (MBS173525, 
MyBioSource), ALAT (MBS173006, MyBioSource), S100A9 (pro-814, Prospecbio), 
HORN (EBP08267, Biotrend USA), VITDB (CSB-EP009306HU, Biotrend USA), 
LRG1 (pro-141, Prospecbio), AIBG (RPE570Hu01, Cloud-Clone), CRSP3 
(RD172262100, BioVendor), APOA1 (16-16-120101-LEL, Athens Research & 
Technology), APOA2 (16-16-120102, Athens Research & Technology), APOA4 
(16-16-120104, Athens Research & Technology), APOB (16-16-120200, Athens 
Research & Technology), APOC] (16-16-120301, Athens Research & Technology), 
APOC2 (16-16-120302, Athens Research & Technology), APOC3 (16-16-120303, 
Athens Research & Technology), hAPOE (16-16-120500, Athens Research & 
Technology), mAPOE (CJ05, Novoprotein), APOE2 (350-12, Peprotech), APOE3 
(350-02, Peprotech), APOE4 (350-04, Peprotech), PODXL2 (1524-EG-050, R&D 
Systems), CD44 (12211-H08H, Sino Biological), HCK (PV6128, Thermo Fisher), 
VEGFR3 (10806-H08H, Sino Biological), NRG3 (16071-H08H, Sino Biological), 
PI16 (H00221476-P01, Novusbio), hMAG (8940-MG-050, R&D Systems), mMAG 
(8580-MG-100, R&D Systems), CNTF (303-CR-050, R&D Systems), ANGPTL-7 
(914-AN-025/CE, R&D Systems), integrin-a.181 (7064-AB-025, R&D Systems), 
integrin-a281 (5698-AB-050, R&D Systems), integrin-«2(3 (7148-AB-025, R&D 
Systems), integrin-0331 (2840-A3-050, R&D Systems), integrin-0481 (5668-A4- 
050, R&D Systems), integrin-a487 (5397-A3-050, R&D Systems), integrin-0581 
(3230-A5-050, R&D Systems), integrin-«583 (3050-AV-050, R&D Systems), 
integrin-05(35 (2528-AV-050, R&D Systems), integrin-«586 (CT039-H2508H, Sino 
Biological), mintegrin-«586 (CT051-M2508H, Sino Biological), integrin-0588 
(4135-AV-050, R&D Systems), integrin-0634 (5497-A6-050, R&D Systems), 
integrin-a881 (CT016-H2508H, Sino Biological), integrin-981 (5438-A9-050, 
R&D Systems), integrin-«1031 (5895-AB-050, R&D Systems), integrin-a1181 
(6357-AB-050, R&D Systems), integrin-aE87 (5850-A3-050, R&D Systems), 
integrin-aX82 (CT017-H2508H, Sino Biological) and normal mouse serum 
(NSO3L, Millipore sigma). 

Bio-layer interferometry. Binding interaction analyses between LILRB4-Fc and 
APOE2, APOE3 or APOE4 were performed on the Octet RED96 (ForteBio, Pall). 
All interaction studies were performed with the protein A dip-and-read biosensors 
(ForteBio). All binding experiments were performed using the Octet Red and 
kinetics buffer at 30°C. LILRB4-Fc coated biosensors (25 j1g/ml LILRB4-Fc was 
loaded for 420 s) were washed in kinetics buffer before monitoring of association 
(300 s) and dissociation (600 s) of APOEs. Background wavelength shifts were 
measured from reference sensors that were loaded only with LILRB4-Fc. 
Surface plasmon resonance. Biacore 2000 and CM5 chips were used to analyse 
binding of recombinant APOEs to the LILRB4 extracellular domain fused to hFc 
as described”. Recombinant protein A (Pierce) was pre-immobilized in two flow 
cells using the amine-coupling kit from GE. LILRB4-hFc was injected into one 
of the flow cells to be captured by protein A. Each binding sensorgram from the 


sample flow cell, containing a captured LILRB4-hFc, was corrected for the protein 
A-coupled cell control. Following each injection of an antigen solution, which 
induced the binding reaction, and the dissociation period during which the run- 
ning buffer was infused, the protein A surface was regenerated by injection of 
regeneration solution containing 10 mM Na3PQO, (pH 2.5) and 500 mM NaCl. All 
captured LILRB4-hFc, with and without APOE bound, was completely removed, 
and another cycle begun. All measurements were performed at 25°C with a flow 
rate of 30 l/min. 

Microscale thermophoresis. MST experiments were performed on a Monolith 
NT.115 system (NanoTemper Technologies) using 80% LED and 20% IR-laser 
power. Laser on and off times were set at 30 s and 5 s, respectively. Recombinant 
LILRB4-ECD protein (SinoBio) was labelled with 4488-NHS (NanoTemper 
Technologies) and applied at a final concentration of 5.9 nM. A twofold dilution 
series was prepared for unlabelled His-APOE (C106, Novoprotein) in PBS, and 
each dilution point was similarly transferred to LILRB4—ECD solution. The final 
concentrations of His~APOE ranged from 0.36 nM to 12 \1M. Samples were filled 
into standard-treated capillaries (NanoTemper Technologies) for measurement. 
Western blotting and co-immunoprecipitation. Whole cells were lysed in 
Laemmli sample buffer (Sigma-Aldrich) supplemented with protease inhibitor 
cocktail (Roche Diagnostics). Samples were separated on SDS-PAGE gels (Bio- 
Rad) and transferred onto nitrocellulose membranes (Bio-Rad) for protein detec- 
tion. Primary antibodies including anti-SHP-1 (Cell Signaling, 3759, 1:1,000), 
anti-phospho-SHP-1 Tyr564 (Cell Signaling, 8849, 1:500), anti-phospho-SHP-1 
Tyr564 (Invitrogen, PA537708, 1:500), anti-SHP-2 (Cell Signaling, 3397, 1:1,000), 
anti-phospho-SHP-2 Tyr580 (Cell Signaling, 3703, 1:500), anti-SHIP1 (Cell 
Signaling, 2727, 1:1,000), anti-phospho-SHIP1 Tyr1020 (Cell Signaling, 3941, 
1:500), anti- NFkB p65 (Cell Signaling, 8242, 1:1,000), anti-IKKa (Cell Signaling, 
11930, 1:1,000), anti-IKKB (Cell Signaling, 8943, 1:1,000), anti-phospho-IK- 
Ka/6 Ser176/180 (Cell Signaling, 2697, 1:500), anti-Ik Ba (Cell Signaling, 4814, 
1:1,000), anti-phospho-Ik Ba Ser32 (Cell Signaling, 2859, 1:500), anti-Lamin-B2 
(Cell Signaling, 12255, 1:1,000), anti-arginase-1 (Cell Signaling, 9819, 1:1,000), 
anti-uPAR (Invitrogen, MON R-4-02, 1:500), anti-LILRB4 (Santa Cruz, sc-366213, 
1:200), anti-APOE (Creative Diagnostics, DCABH-2367, 1:250), anti-B-actin 
(Sigma-Aldrich, A2066, 1:1,000) and anti-c-tubulin (Sigma-Aldrich, MABT205, 
1:1,000), as well as horseradish peroxidase conjugated secondary antibodies (Cell 
Signaling, 7074, 1:1,000, and 7076, 1:1,000) and chemi-luminescent substrate 
(Invitrogen), were used. Specific cellular compartment fractionations were carried 
out using the NE-PER nuclear/cytoplasmic extraction kit (Thermo Fisher, 78833) 
or the plasma membrane protein extraction kit (Abcam, ab65400). Proteins from 
plasma membrane fraction were further incubated with anti-LILRB4 antibodies 
and dynabeads protein A (Thermo Fisher, 10001D) for further immunoprecipi- 
tation and western blotting. 

Immunohistochemistry. Haematoxylin staining and immunostaining were per- 
formed on paraffin sections of tumours. Antibodies used were against LILRB4 (lab- 
oratory produced, 1:100), CD3 (Abcam, ab16669, 1:100), PD-1 (Thermo Fisher, 
J116, 14-9989-82, 1:100) and arginase-1 (Cell Signaling, 9819S, 1:100). The images 
were visualized using the Hamamatsu NanoZoomer 2.0-HT (Meyer instruments) 
and viewed in NPDview2 software (Hamamatsu). 

Cytokine antibody array and arginase activity assay. To examine the secreted 
protein from leukaemia cells, conditioned media were applied to a human cytokine 
antibody array (AAH-CYT-1000, RayBio) for the semiquantitative detection of 
120 human proteins. Image J (NIH) was used for quantification. Arginase 
activity was determined in condition medium of indicated leukaemia cells by a 
QuantiChrom Arginase assay kit (DARG100, BioAssay system). 

RNA-seq analysis. RNA was purified from sorted cells with Qiagen RNeasy 
Mini Kit and then reverse-transcribed with SuperScript III Reverse Transcriptase 
(Invitrogen) according to the manufacturer's instructions. RNA-seq was performed 
at the UTSW Genomics and Microarray Core Facility. The cDNA was sonicated 
using a Covaris S2 ultrasonicator, and libraries were prepared with the KAPA 
High Throughput Library Preparation Kit. Samples were end-repaired, and the 
3/ ends were adenylated and barcoded with multiplex adapters. PCR-amplified 
libraries were purified with AmpureXP beads and validated on the Agilent 2100 
Bioanalyzer. Before being normalized and pooled, samples were quantified by 
Qubit (Invitrogen) and then run on an Illumina Hiseq 2500 instrument using 
PE100 SBS v3 reagents to generate 51-bp single-end reads. Before mapping, reads 
were trimmed to remove low-quality regions in the ends. Trimmed reads were 
mapped to the human genome (HM19) using TopHat v2.0.1227 with the UCSC 
iGenomes GTF file from Illumina. 

Methods for data normalization and analysis are based on the use of ‘inter- 
nal standards’ that characterize some aspects of the system’s behaviour, such as 
technical variability, as presented elsewhere. Genes with log, (fold change) > 2, 
P<0.01 and RPKM >0.1 were deemed to be significantly differentially expressed 
between the two conditions and were used for pathway analysis and upstream 
transcription factor analysis. Pathway analysis was conducted using the DAVID 
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(https://david.ncifcrf.gov/tools.jsp). Upstream transcription-factor analysis was 
conducted using QIAGEN's Ingenuity tool (http://www.ingenuity.com/). 
Molecular docking of LILRB4 with APOE. Docking of LILRB4 with APOE was 
performed on ZDOCKpro module of the Insight II package. The general protocol 
for running ZDOCK includes two consecutive steps of calculation described as 
geometry search and energy search, running in program ZDOCK and RDOCK, 
respectively. LILRB4 crystal structure (3P2T) and APOE3 structure (2L7B) were 
obtained from the Protein Data Bank database. The top 50 ZDOCK poses were 
submitted to RDOCK refinement. Poses with high scores in both ZDOCK and 
RDOCK were selected as candidate complex for LILRB4—APOE interaction anal- 
ysis (Supplementary Table 2). 
Statistical analyses. Representative data from four independent experiments or 
indicated independent samples are presented as dot plots (means + s.e.m.) or 
as box-and-whisker plots (median values (line), 25th-75th percentiles (box out- 
line) and minimum and maximum values (whiskers)). Statistical significance for 
two-sample comparisons was calculated by two-tailed Student's t-test. Statistical 
significance for survival was calculated by the log-rank test. The multivariate 
analysis of TCGA data was analysed by Cox regression. The difference was con- 
sidered statistically significant if P < 0.05. NS, not significant; exact P values are 
shown. Pearson's correlation analyses were performed with RStudio software (the 
R Foundation). 
Code availability. The custom code for Pearson’s correlation analysis in RStudio 
is shown below. 

setwd(“~/file paths”) 

draw.graph = function(sam) { 

file = paste(sam, “tsv’, sep =”) 

dfl = read.table(“file name A.tsv’, sep = “\t’, header = T) 

df2 = read.table(file, sep = “\t? header = T) 

df = merge(df1, df2, by.x = “sample’, by-y = “sample”) 

df = df[!is.na(df[,2]),] 

r =cor.test(df[,2], df[,3]) 

P=cor.test(df[,2], df[,3])$p.value 

jpeg(filename = paste(sam;?jpeg’, sep = “"), 

width = 280, height = 280, units = “px’, pointsize = 12, 

quality = 75) 

plot(df[[2]], df[[3]], xlab = “Title”, ylab = sam, 

pch = 16, cex = 1, col = “red”, 

main = paste(“r = Sround(r$estimate,2), “\n’? P = “round(p,8), sep = “")) 


reg = Im(df[[2]]~df[[3]]) 
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abline(reg) 
dev.off() 
} 


sample = c(“file name B”) 
for (s in sample) { 
draw.graph(s) 

} 


Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability 

The TCGA datasets analysed are available in the UCSC Xena Browser (https:// 
xena.ucsc.edu). The RNA-seq datasets generated in the current study have been 
deposited in NCBI SRA database with the SRA accession number SRP155049. 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | LILRB4 expression in patients with AML 
negatively correlated with overall survival and T cell proliferation. 

a, Analysis of correlation between mRNA levels of immune-modulating 
molecules and the overall survival of patients with AML (n = 160, divided 
into three groups based on gene expression) in TCGA database (https:// 
xena.ucsc.edu) by Kaplan-Meier long-rank test. b, Kaplan-Meier analysis 
of correlations between LILRB4 mRNA level and the overall survival of 
patients (n = 160) from the TCGA database, performed in Xena browser 
(https://xena.ucsc.edu). Low, n = 57; medium, n = 48; high, n = 55. 

The P value was from Kaplan-Meier log-rank test. c, mRNA expression 
data from the TCGA database were analysed as a function of the AML 
subtype of the patient. MO, n = 16; M1, n = 42; M2, n = 39; M3, n= 16; 
M4, n = 35; M5, n = 18; M6, n = 2; M7, n = 3; and two not-classified 
AML samples. Pairwise comparisons between M4 and each one of the 
other subtypes (all P < 0.0001), as well as between M5 and each one of 
the other subtypes (all P < 0.0001), using two-sample t-test. Mean and 
s.e.m. values are shown. d, A multivariable Cox regression analysis to 
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assess the association, with adjustment for confounders that include age, 
cytogenetics and PML-RAR mutation in TCGA database. The total sample 
size was 79. *P < 0.05 is considered significant. e, f, Autologous T cells 
isolated from individual patients with monocytic AML or B-ALL were 
incubated with irradiated LILRB4-positive or LILRB4-negative primary 
leukaemia cells from the same patients. pT, patient T cells. Allogeneic 

T cells isolated from healthy donors were incubated with irradiated 
LILRB4-positive or LILRB4-negative primary leukaemia cells from 
indicated patients with AML or B-ALL at an E:T of 10:1. nT, normal 

T cells. After culture with anti-CD3/CD28/CD137-coated beads and 
rhIL-2 for 14 days, T cells were stained with anti-CD3, anti-CD4, and 
anti-CD8 antibodies and analysed by flow cytometry. e, f, P values from 
two-tailed Student’s t-test. P values in black indicate significance of 
CD3*CD8* cells; P values in red indicate significance of CD3*CD4* cells. 
n = 2 or 3 biologically independent samples with mean and s.e.m. See raw 
data for e and f in Source Data. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | LILRB4 suppresses T cell proliferation in vitro. 
a, Schematic of preparation of LILRB4-modulated THP-1 cells and 
examination of LILRB4 expression on the cell surfaces by flow cytometry. 
WT, THP-1 cells treated with scrambled control; LILRB4®°, LILRB4- 
knockout THP-1 cells; LILRB4*°-wt, forced expression of wild-type 
LILRB4 in LILRB4*° THP-1 cells; LILRB4*°-intA, forced expression of 
intracellular domain-deleted mutant LILRB4 in LILRB4®° THP-1 cells. 

b, Loss of LILRB4 on THP-1 cells reduces T cell suppression. 
Representative photograph of Fig. 1c (scale bar, 100 zm). ¢, Loss of 
LILRB4 on THP-1 cells does not affect cell proliferation (n = 3 biologically 
independent samples with mean and s.e.m.). d, Examination of LILRB4 
expression on cell surface of LILRB4*° MV4-11 cells by flow cytometry. 

e, f, Loss of LILRB4 on MV4-11 cells reduces T cell suppression. T cells 
isolated from healthy donors incubated in the lower chambers of a 96- 
well transwell plate with irradiated MV4-11 cells (E:T of 2:1) in the upper 
chamber separated by a membrane with 3-\1m pores. After culture with 
anti-CD3/CD28-coated beads and rhIL-2 for 7 days, representative cells 
were photographed using an inverted microscope (scale bar, 100 1m) (e) 
and T cells were stained with anti-CD3 and analysed by flow cytometry (f). 
n = 4 biologically independent samples. g, Loss of LILRB4 on MV4-11 
cells does not affect cell proliferation (n = 3 biologically independent 
samples with mean and s.e.m.). h, i, T cells (E, effector cells) isolated from 
healthy donors were incubated with indicated irradiated THP-1 cells 

(T, target cells) without direct contact in transwells for 2 days. E:T = 2:1. 
T cells were treated with BrdU for 30 min followed by BrdU and 7-AAD 
staining for flow cytometry analysis. Representative flow cytometry plots 
are shown in h and the cell cycle status is summarized in i. T control, T cells 
cultured without THP-1 cells. n = 3 biologically independent samples 
with mean and s.e.m. j, k, T cells (E, effector cells) isolated from healthy 
donors were stained with CFSE and incubated with indicated irradiated 
THP-1 cells (T, target cells) without direct contact in transwells for 2 days. 
A representative flow cytometry plot is shown in j and the percentages 

of proliferating T cells indicated by CFSE-low staining is shown in k. 

n = 3 biologically independent samples with mean and s.e.m. 1, LILRB4 
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increases PD-1 expression on T cells in coculture of leukaemia cells and 

T cells. T cells (E, effector cells) isolated from healthy donors were 
incubated with indicated irradiated THP-1 cells (T, target cells) ina 
non-contact manner for 5 days. E:T = 2:1. T cells were stained with anti- 
LAG-3, anti-TIM-3, anti-TIGIT, anti-PD-1 and anti-FasL antibodies for 
flow cytometry analysis. Representative flow cytometry plots and the mean 
of fluorescence intensities, at the right-upper corner (black, WT; red, KO), 
are shown. Experiments were performed three times with similar results. 
m, n, Anti-LILRB4 antibody had no effect on proliferation of THP-1 cells (m) 
or T cells (n). m, The growth of THP-1 cells during 7 days treatment with 
IgG or anti-LILRB4 antibody (n = 3 biologically independent samples with 
mean and s.e.m.). n, The numbers of human primary T cells after 5 days 
treatment with IgG or anti-LILRB4 antibody in vitro (n = 3 biologically 
independent samples with mean and s.e.m.). 0, p, Primary T cells and 
irradiated THP-1 cells (E:T ratio, 2:1) were placed in the lower and upper 
chamber, respectively, and treated with 10 j1g ml control IgG or anti- 
LILRB4 antibodies. 0, Representative photographs of T cells (scale bar, 

100 1m). p, T cells stained with anti-CD3 and analysed by flow cytometry. 
n= 4 biologically independent samples. q, Primary T cells stimulated with 
anti-CD3/CD28/CD137-coated beads were co-cultured with wild-type or 
LILRB4*° THP-1 cells with indicated E:T ratios for 4h (n = 3 biologically 
independent samples with mean and s.e.m.). Cytotoxity of leukaemia cells 
was determined by PI staining in flow cytometry analysis. r-u, CD8* 

T cells (5 x 104 cells) stimulated with anti-CD3/CD28/CD137-coated 
beads were co-cultured with 5 x 10? THP-1 cells that stably express GFP 
and treated with 100 jig ml“! anti-LILRB4 antibodies or control IgG for 

5 days. s, t, n = 4 biologically independent samples; u, n = 3 biologically 
independent samples with mean and s.e.m. Representative flow plots (r) 
of the percentages of T cells (GFP”) and surviving leukaemia cells (GFP*), 
and quantification of T cells (s), GFP+ leukaemia cells (t), and secretion of 
IFN7 (u), are shown. b, d, e, h, j, 0, r, Experiments repeated independently 
three times with similar results. See Methods for definition of box plot 
elements in f, p, s, t. All P values were from two-tailed Student's t-test. 
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Extended Data Fig. 3 | Inhibition of LILRB4 reduces leukaemia 
development in humanized immunocompromised mice and syngeneic 
mice. a-c, Wild-type or LILRB4*° THP-1 cells (3 x 10° cells per mouse) 
were subcutaneously implanted into hPBMC-repopulated NSG mice 
(WT, n = 14 mice with mean and s.e.m.; LILRB4*®°, n = 10 mice, mean 
and s.e.m.; also see Source Data). Tumour size (a), quantification of CD3T 
cells at day 31 in peripheral blood of recipient mice (b) and representative 
flow plots showing CD4* and CD8* T cells (c) are shown. d, e, LILRB4 
increases PD-1 expression on tumour-infiltrated T cells. Wild-type or 
LILRB4*° THP-1 cells were subcutaneously implanted into hPBMC- 
repopulated NSG mice. Three weeks after implantation, 7 out of 10 WT 
group mice had large tumours and 3 out of 10 knockout-group mice had 
tiny tumours. These tumours were dissected for immunohistochemistry 
and flow cytometry staining with anti-LILRB4, anti-CD3, anti-PD-1 and 
anti-ARG1 antibodies. Left corner images are magnified from yellow 
highlighted regions. In CD3 and PD-1 staining images, orange dashed 
lines indicate the tumour boundary. Black arrowheads indicate PD-1 
positive cells. Scale bar, 100 um. e, Tumours were dissected and cells in 
tumour region were stained with anti-CD3 and anti-PD-1 antibodies 

for flow cytometry analysis. The percentages of PD-1* T cells (ratio of 
PD-1*CD3* cells to CD3* cells) were calculated. f-i, THP-1 cells were 
transplanted into hPBMC-repopulated NSG mice, and mice were treated 
with control IgG or anti-LILRB4 antibodies after 6 days (10 mg kg”; 

n= 5). Leukaemia development was monitored by luminescence imaging (f); 
luminescence flux (radiance) at day 26 (g; n = 5) and T cell numbers 

at day 26 in representative mice (h, i). j, k, Engraftment of human 

T cells and intravenously transplanted Dox-inducible LILRB4-knockout 
THP-1 cells (GFP*) in NSG mice at day 7 before Dox administration 

(n = 5). 1, Representative flow plot shows that LILRB4 was successfully 
deleted in engrafted leukaemia cells in bone marrow of Dox-fed mouse 

at the endpoint. n.s., not significant. m-w, Mouse AML C1498 cells 

(3 x 10° cells per mouse) that stably express LILRB4-IRES-GFP were 
subcutaneously implanted into C57BL/6 mice. Anti-LILRB4-N297A 
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antibodies or control IgG were intravenously injected at 6, 9, 12, 15, 18 
and 21 days after implantation of tumour cells. Two groups of mice were 
treated with anti-CD8 antibodies at 3, 6, 9 and 12 days after implantation 
of tumour cells to achieve CD8* T cell depletion. m, Tumour growth 

of subcutaneously implanted human LILRB4-expressing mouse AML 
C1498 cells (hLILRB4 C1498) in C57BL/6 mice with anti-LILRB4-N297A 
antibodies or control antibody treatment (n = 5 mice). Also see Source 
Data. n, Survival curve of subcutaneous hLILRB4 C1498 tumour-bearing 
mice (n = 12 mice). As for tumour size, anti-LILRB4 antibodies decreased 
the tumour weight (0, n = 5 mice) but did not do so in the absence of 
CD8* T cells (p, 2 = 5 mice). The percentage of CD8* T cells in spleen 
was significantly negatively correlated with tumour size (q, n = 5 mice) 
but not in the absence of CD8* T cells (r, 1 = 5 mice). s, Adoptive 
transplantation of spleen cells from control mice or tumour-bearing 

mice that were cured by anti-LILRB4-N297A treatment (n = 5 mice). 
Tumour size was monitored as a function of time. Arrow indicates day of 
rechallenge in mice that had eliminated leukaemia with three times the 
number of AML cells (n = 4 mice). Also see Source Data. Anti-LILRB4 
antibodies reduced the infiltration of leukaemia cells into host tissues 
(t-v, n = 5 mice) and even CD8* cells were depleted (w, n = 5 mice). 
x-z, C57BL/6 mice were intravenously injected with human LILRB4- 
expressing mouse AML C1498 cells (3 x 10° cells per mouse) that 
expressed GFP. Anti-LILRB4-N297A antibodies (n = 9 mice) or control 
IgG (n = 9 mice) were intravenously injected at 6, 9, 12, 15 and 18 days 
after implantation of tumour cells. Anti-LILRB4 antibodies decreased 

the percentage of leukaemia cells in bone marrow (x). Anti-LILRB4 
antibodies increased CD8* T cells (y). The percentage of CD8* T cells in 
bone marrow was significantly negatively correlated with the percentage 
of leukaemia cells (z). c, i, 1, Experiments repeated independently three 
times with similar results. See Methods for definition of box plot elements 
in b, e, g, h, j, k, 0, p, t-y. All P values (except n, long-rank test; and q, r, z, 
Pearson’s correlation) from two-tailed Student’s t-test. 
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Extended Data Fig. 4 | Anti-LILRB4 antibodies reduce leukaemia 
development by restoring autologous T cells in PDX mice and 
inhibiting primary AML cell infiltration. a, Primary peripheral blood or 
bone marrow mononuclear AML cells (5 x 10°to 1 x 107 cells per mouse) 
from each of sixteen human patients (three shown in Fig. 1g-i, also 

see Supplementary Table 5) were injected into NSG mice followed by 
treatment with IgG or anti-LILRB4 antibodies (10 mg kg! twice a week by 
intravenous injection). Percentages of human CD45*LILRB4* AML cells 
collected from haematopoietic tissues including bone marrow, spleen, liver 
and peripheral blood 2-4 months after transplantation, as determined by 
flow cytometry, are shown. b, Percentages of autologous human T cells 
collected from haematopoietic tissues including bone marrow, spleen, liver 
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and peripheral blood 2-4 months after transplantation, as determined 
by flow cytometry; and representative flow plots of CD3*CD8* T cells 
in bone marrow of mice in three PDXs. n = 8 biologically independent 
samples for all PDXs except AML#11 (n = 20 biologically independent 
samples) in a, b. c-e, Comparison of infiltration of human primary 
monocytic AML cells in NSG mice (n = 5 mice) after treatment with 
anti-LILRB4 antibody or IgG control. c, d, Primary human peripheral 
blood mononuclear cells from patients with monocytic AML were 
injected. The quantifications in c are also shown in Fig. 21-n. e, Mouse 
liver cells with xenografted primary human monocytic AML cells (human 
CD45*LILRB4* cells) were injected. See Methods for definition of box 
plot elements in a-e. All P values from two-tailed Student's t-test. 
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Extended Data Fig. 5 | LILRB4 promotes infiltration of AML cells. 

a, c, LILRB4 expression on mouse C1498 (a) or WEHI-3 (c) AML cells 
that stably express Lilrb4 (also known as Lilrb4a). b, d, Forced 

expression of LILRB4 did not affect proliferation of mouse C1498 

(b, n = 3 biologically independent samples with mean and s.e.m.) or 
WEHI-3 (d, n = 3 biologically independent samples with mean and 
s.e.m.) AML cells. e, Forced expression of human LILRB4 promoted 
transendothelial migration of mouse AML WEHI-3 cells (n = 3 
biologically independent samples with mean and s.e.m.). f, NSG mice 

(n = 6 mice) were injected with 1 x 10° THP-1 cells followed immediately 
by IgG or anti-LILRB4 antibody treatment and were monitored by 
bioluminescence imaging. g, h, Anti-LILRB4 antibodies decreased AML 
cell infiltration into internal organs. Mice were killed at 21 days for ex 
vivo bioluminescence imaging of internal organs after transplantation of 
1 x 10° luciferase-expressing THP-1 cells. Images of luminescence flux 
(radiance) from representative mice are shown (g). 1, GI tract; 2, legs; 3, 
lung; 4, spleen; 5, liver; 6, kidneys; 7, brain; 8, heart. Infiltrated leukaemia 
cells formed tumour nodules in liver (h). i, j, Anti- LILRB4 antibodies did 
not affect LILRB4-negative cancer cells. LILRB4 is expressed on THP-1 
and MV4-11 human AML cells but not on U937 cells as analysed by flow 
cytometry (i). Isotype IgG was used as control. NSG mice were injected 
with U937 human AML cells, which do not express LILRB4, and then 
treated with anti-LILRB4 antibodies (j). IgG served as control antibodies. 
Mice were killed at day 25 post-transplant for analysis of LV, BM, SP and 
PB by flow cytometry. The presence of human AML cells was detected by 
anti-human CD45 antibody staining (m = 4 mice with mean and s.e.m.). 
k-t, Anti-LILRB4 antibodies decreased infiltration of THP-1 (k-o) 

or MV4-11 (p-t) human AML cells. Comparison of transendothelial 
migration abilities of GFP-expressing THP-1 (k) or CFSE-labelled 
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MV4-11 (p) cells after treatment with anti-LILRB4 antibody or IgG 
control in a transwell assay (n = 3 biologically independent samples with 
mean and s.e.m.). Comparison of the homing abilities of GFP-expressing 
THP-1 or CFSE-labelled MV4-11 cells (5 x 10° per mouse) that were 
injected into NSG mice followed immediately by IgG or anti-LILRB4 
antibody treatment. Numbers of leukaemia cells (GFP* in 1, CFSE* in q) 
in LV, SP and BM normalized to that in PB as determined by flow 
cytometry 20 h after injection (n = 5 mice). NSG mice were injected with 
1 x 10° THP-1 or MV4-11 cells followed immediately by IgG or anti- 
LILRB4 antibody treatment (mn = 6 mice for THP-1 or 5 mice for MV4-11 
xenografts). Percentages of MV4-11 cells (stained with anti-human CD45) 
as determined by flow cytometry in indicated organs at day 21 post- 
transplant (m, r), overall survival (n, s) and body weights as a function 

of time (0, t) are shown. u, Targeted immune cell populations were 
depleted in NSG mice. Representative flow cytometry plots demonstrating 
successful reduction of NK cell (CD45+CD49b*), macrophage 
(CD11b*F4/80*), and neutrophil (CD11b*CD11c’) frequency in NSG 
mice depleted of the respective immune cell subtype by treatment 

with anti-asialo GM1 antibodies, clodronate liposomes and anti-Ly6G 
antibodies, respectively, compared to non-depleted (wild-type) NSG mice. 
v, CFSE-labelled MV4-11 cells (5 x 10° per mouse) were injected into 
NSG mice in which the respective innate immune cells were depleted, 
followed immediately by IgG or anti-LILRB4-N297A antibody treatment 
(n = 5 mice). Numbers of leukaemia cells (CFSE positive) in LV, SP and 
BM normalized to that in PB at 20 h post-injection. a, c, i, u, Experiments 
repeated independently three times with similar results. See Methods for 
definition of box plot elements in], m, 0, q, r, t, v. All P values (except n, s, 
log-rank test) are from two-tailed Student’s t-test. 
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Extended Data Fig. 6 | APOE induces LILRB4 activation to suppress 

T cells and support AML cell migration in vitro. a, Schematic of the 
LILRB4 reporter system. b, Human and mouse integrin heterodimer 
proteins cannot activate the LILRB4 reporter (n = 3 biologically 
independent samples with mean and s.e.m.). Human and mouse sera 

were used as positive controls. The threshold of activation is twice that 

of negative control treatment. c, Flow cytometry demonstrating that 
anti-LILRB4 antibody binds to human LILRB4 reporter cells. d, LILRB4 
activation as indicated by percentage of GFP* cells in the presence and 
absence of 10% human serum (HS) with or without anti-LILRB4 antibody 
or control IgG (n = 3 biologically independent samples with mean and 
s.e.m.). e, Flow chart of ligand identification of potential ligands of LILRB4 
in human serum. f, Fractionation of LILRB4-stimulating activities from 
human serum by fast protein liquid chromatography. The positive control 
was 10% human serum. g, Proteins identified from the LILRB4 stimulating 
fractions by mass spectrometry. PSMs, peptide spectrum matches. h, Both 
human and mouse APOE proteins can activate LILRB4 reporter (nm = 3 
biologically independent samples with mean and s.e.m.). Human and 
mouse sera were used as positive controls. The threshold of activation is 
twice that of negative control treatment. i, APOE proteins from different 
sources all activate LILRB4. APOE (20 jg ml“) purified from human 
plasma, His-tagged or tag-free recombinant human APOE (rhAPOE) 

(20 jug ml~!) expressed by 293T mammalian cells, or rhAPOE (20 jg ml’) 
expressed by bacteria all activate the LILRB4 reporter. These APOE all 
represent human APOE3 (n = 3 biologically independent samples with 
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mean and s.e.m.). j, APOE2, APOE3 and APOE4 all activate the LILRB4 
reporter (n = 3 biologically independent samples with mean and s.e.m.). 
Forty micrograms per millilitre APOEs were coated on plates or directly 
added in cell culture medium (soluble). k, 1, Three APOE isoforms bind 
to human LILRB4. k, Binding kinetics of APOE2, 3 and 4 to LILRB4-Fc 
were measured using SPR. LILRB4-Fc was immobilized on protein A 
biosensor tips and incubated with APOE concentrations ranging from 
1.5625 nM to 100 nM. I, Binding kinetics of APOE2, 3, and 4 to LILRB4-Fc 
were measured using bio-layer interferometry (Octet). LILRB4-Fc was 
immobilized on protein A biosensor tips and incubated with APOE 
concentrations ranging from 44 nM to 1,176 nM. m, As shown in Fig. 3h, 
mutation of W106 and Y121, located in the first immunoglobulin domain 
and in the linker between two immunoglobulin domains, respectively, 
significantly reduced activation of LILRB4 by APOE. n, p, Examination 
of APOE expression in APOE-knockout THP-1 and MV4-11 cells by 
immunoblots. Primary T cells and irradiated THP-1 or MV4-11 cells 

(E:T = 2:1) were incubated in the lower and upper chambers, respectively. 
T cells were photographed (0, q, scale bar, 100 j.m) and quantified by flow 
cytometry (Fig. 3i and r, n = 4 biologically independent samples) after 

7 days. s, t, Loss of APOE suppresses transendothelial migration of human 
AML THP-1 and MV4-11 cells (n = 4 biologically independent samples 
with mean and s.e.m.). ¢, k, 1, n-q, These experiments were repeated 
independently three times with similar results. See Methods for definition 
of box plot elements in r. All P values are from two-tailed Student’s t-test. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | LILRB4 upregulates phosphorylation of SHP-2, 
NF«B signalling and expression of uPAR and ARGI to suppress T cell 
activity and support leukaemia migration. a, Phosphorylated SHP-2 and 
Ik Ba were downregulated upon LILRB4 knockout in MV4-11 cells. 

b, Co-immunoprecipitation demonstrated that LILRB4 interacts with 
SHP-2 in THP-1 cells. c, SHP1, SHP2 and SHIP were individually 
knocked out by CRISP-Cas9 in THP-1 cells as detected by western 
blotting. d, Primary T cells and irradiated THP-1 cells (E:T = 2:1) were 
cultured in the lower and upper chambers, respectively. T cells were 
photographed (scale bar, 100 1m) after 7 days. e, f, Two different NFkB 
inhibitors restored T cell proliferation from the suppression by THP-1 
cells in an LILRB4-dependent manner (n = 4 biologically independent 
samples). THP-1 cells were pretreated with various doses of NFkB 
inhibitors for 1 h. Primary T cells and irradiated pretreated THP-1 cells 
(E:T = 2:1) were cultured in the lower and upper chambers, respectively. 
T cells were photographed (e, scale bar, 100 1m) and analysed by flow 
cytometry (f) after 7 days. g, h, Loss of LILRB4 decreased secreted protein 
production in THP-1 cells as determined by a human cytokine antibody 
array (g) and the blot intensities were quantified by Image] software 

(h, n = 3 biologically independent samples with mean and s.e.m.). Red 
boxes indicate proteins that were changed upon LILRB4 knockout; blue 
boxes indicate positive controls. i, Surface uPAR was downregulated 

in LILRB4*° THP-1 and MV4-11 AML cells. j, T cells were incubated 
with irradiated indicated THP-1 cells supplemented with indicated 
concentrations of recombinant uPAR proteins for 7 days. T cells were 
photographed. k, T cells isolated from healthy donors were cultured 

with anti-CD3/CD28-coated beads and rhIL-2 and supplemented with 
indicated concentrations of uPAR proteins for 3 days (n = 4 biologically 
independent samples). Representative cells were photographed using an 
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inverted microscope and T cells were analysed by flow cytometry. 

1, Expression of uPAR and ARG1 is downregulated in in LILRB4 knockout 
THP-1 and MV4-11 AML cells. m, Arginase activity as determined by 

a colorimetric method (DARG100, BioAssay system) was decreased 

in condition medium of LILRB4*° THP-1 and MV4-11 cells (n = 3 
biologically independent samples with mean and s.e.m.). n, Primary 

T cells and irradiated indicated THP-1 cells (E:T = 2:1) were incubated 

in the lower and upper chambers, respectively, and were supplemented 
with 0.002 U/l recombinant ARGI proteins for 7 days. T cells were 
photographed. 0, T cells isolated from healthy donors were cultured 

with anti-CD3/CD28-coated beads and rhIL-2 and supplemented with 
indicated concentrations of ARGI proteins for 3 days (n = 4 biologically 
independent samples). Representative cells were photographed using 

an inverted microscope and T cells were analysed by flow cytometry. 

p, Autologous T cells isolated from individual patients with monocytic 
AML were incubated with irradiated LILRB4-positive or LILRB4- 
negative primary leukaemia cells from the same patients at an E:T of 

10:1, supplemented with recombinant anti-LILRB4 antibodies, APOE- 
VLDL, uPAR or ARGI. pT, patient T cells. After culture with anti-CD3/ 
CD28/CD137-coated beads and rhIL-2 for 14 days, T cells were stained 
with anti-CD3, anti-CD4, and anti-CD8 antibodies and analysed by flow 
cytometry. n = 3 biologically independent samples with mean and s.e.m. 
q; Supplementation of recombinant uPAR or ARGI1 to the medium rescued 
the decrease in transmigration ability of LILRB4*° THP-1 or LILRB4*° 
MV4-11 cells across endothelium (n = 3 biologically independent samples 
with mean and s.e.m.). Scale bar, 100 jum. a-e, g, i, j, 1, n, Experiments 
repeated independently three times with similar results. See Methods 

for definition of box plot elements in f, k, 0. All P values from two-tailed 
Student’s t-test. See raw data for p in Source Data. 
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Extended Data Fig. 8 | Detection of SHP-2-NFKB signalling and 
uPAR and ARGI expression in primary human monocytic AML 
cells. a, LILRB4-positive or -high CD33+ AML cells (red box) and 


LILRB4-negative or -low CD33+ AML cells (blue box) were gated for 
further intracellular staining of SHP-2 phosphorylated at Y580, IKKa/B 
phosphorylated at $176/S180, NFkB phosphorylated at $529, uPAR and 
ARGL. Isotype IgG was used as negative control. Red numbers indicate 
MFI of LILRB4-positive or -high CD33* AML cells; blue numbers indicate 


uPAR 
LILRB4 


3 
Plasma membrane 


T suppression 


= 


Infiltration 


gq —---= 


‘se Arginase-1 


MFIs of LILRB4-negative or -low CD33* AML cells. This experiment was 
repeated with eight individual patient samples with similar results. 

b, Quantification of individual staining in LILRB4-positive or -high 
CD33* AML cells versus in LILRB4-negative or -low CD33+ AML 

cells. n = 8 independent patients; see Methods for definition of box plot 
elements. P values from two-tailed Student’s t-test. c, Schematic for the 
mechanisms by which LILRB4 suppresses T cells and promotes leukaemia 
infiltration. 
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Extended Data Fig. 9 | See next page for caption. 


© 2018 Springer Nature Limited. All rights reserved. 


Monocyte-derived Macrophage 


me Control 
@8 20 pg/ml APOE 


LETTER 


Extended Data Fig. 9 | Comparison of LILRB4-mediated intracellular 
signalling in leukaemia cells and in normal haematopoietic cells. 

a, Comparison of LILRB4 surface expression on normal monocytes 
from healthy donors (n = 25 individual donors with mean and s.e.m.) 
and neoplastic monocytic cells from patients with AML (n = 53 
individual patients with mean and s.e.m.). b, Comparison of LILRB4 
surface expression on normal monocytes from two healthy donors 

and on wild-type and LILRB4®° THP-1 cells. This experiment was 
repeated independently three times with similar results. c, Anti-LILRB4 
antibody did not affect homing ability of normal monocytes. Human 
normal monocytes (b) selected through CD14-positive selection. These 
isolated monocytes were pooled and stained by CFSE. After staining, 
monocytes (5 x 10° for each mouse) were injected into NSG mice 
followed immediately by antibody treatment, and then the mice (n = 4 
mice, see Methods for definition of box plot elements) were killed 20 h 
after transplant. The number of CFSE* cells in liver, spleen and bone 
marrow were normalized to that in peripheral blood as determined by 
flow cytometry. d, e, APOE activates LILRB4 intracellular signalling 

in leukaemia cells. Indicated THP-1 cells and primary AML (M5) cells 


were serum-starved overnight and then treated with the indicated 
concentration of human recombinant APOE protein for the indicated 
time. Phospho-SHP-2, phospho-NFKB and ARG1 were examined by 
western blotting. f, The effect of APOE on normal monocytes or in vitro 
differentiated macrophages. Normal monocytes were isolated from healthy 
donors and macrophages were derived from these monocytes after one 
week of differentiation in vitro. Cells were serum-starved overnight and 
then treated with the indicated concentrations of human recombinant 
APOE protein for the indicated times. Phospho-SHP-2, phospho-NFKB 
and ARGI were examined by western blotting. g, APOE induces uPAR 
upregulation on AML cells. Normal monocytes were isolated from healthy 
donors. Indicated primary AML cells and normal monocytes were serum- 
starved overnight and then treated with 20 pg ml“! human recombinant 
APOE protein for eight hours. Surface uPAR was examined by flow 
cytometry. Representative flow plots are shown and MFIs are shown in 
top right corner (black, PBS control; red, APOE treatment). Experiments 
were performed three times with similar results. P values from two-tailed 
Student's t-test. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | Anti-LILRB4 does not affect engraftment of 
normal haematopoietic cells. a, LILRB4 and CD34 co-staining patterns 
for representative samples of human cord blood mononuclear cells 
(hCB MNCs). N/G, neutrophils and granulocytes; M/D, monocytes, 
macrophages and dendritic cells; L/P, lymphocytes, haematopoietic 
stem and progenitor cells. This experiment was repeated independently 
three times with similar results. b, Anti-LILRB4 antibody did not affect 
homing ability of normal haematopoietic progenitor cells. hCB MNCs 
(1 x 107) were injected into NSG mice followed immediately by antibody 
treatment, and then the mice (n = 3 mice with mean and s.e.m.) were 
killed 20 h after transplant. The number of CD45*CD34* HSCs in liver, 
spleen, and bone marrow were normalized to that in peripheral blood as 


determined by flow cytometry. c—e, Anti-LILRB4 antibodies inhibited 
leukaemia development in hCB-humanized NSG mice. c, Schematic of 
the experiment to test whether anti-LILRB4 antibody inhibits leukaemia 
development in hCB-humanized NSG mice. d, Leukaemia development 
was monitored over time by luminescence imaging. This experiment 

was repeated independently twice with similar results. e, Frequency 

of engrafted leukaemia, normal human cells, including human B cells, 
human myeloid cells and human T cells in peripheral blood over time and 
haematopoietic tissues of hCB-humanized mice at 24 days after leukaemia 
transplantation. n = 3 mice with mean and s.e.m. All P values from two- 
tailed Student's t-test. 
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Paths less travelled 


For many, acareer in academia is the goal, but other options could prove more rewarding. 


BY CHRIS WOOLSTON 


any science students and junior 
researchers continue to aspire to a 
career in academia, a dream that 


has persisted for generations. But Nature’s 
biennial survey of salary and job satisfaction 
in the global science community underscores 
an important reality: there is a vast number 
of career opportunities for scientists beyond 
academic research, and some of those options 
might be more rewarding, whether emotion- 
ally, financially or both. 

Nature’s survey — for which fieldwork was 
conducted between June and July 2018 by 
Shift Learning, a London-based research con- 
sultancy — drew responses from 6,413 self- 
selected readers from around the world. 


(Responses from people who hadr’t gone 
beyond an undergraduate degree were filtered 
out, leaving a sample of 4,334.) Nearly 40% of 
respondents live in North America, 35% are 
in Europe and 16% are in Asia. Nature also 
heard from researchers in Australasia, Africa 
and South America. 

The survey asked about salaries, job satis- 
faction, work-life balance, encounters with 
discrimination, mental health and other key 
issues that can define and shape a scientific 
career. The results, along with follow-up inter- 
views with selected respondents, captured the 
diversity of the scientific experience, from the 
struggles to the triumphs. 

More than two-thirds (68%) of respond- 
ents said that they were satisfied or very sat- 
isfied with their careers, a rate that is largely 


unchanged from the 2016 survey. Still, there's 
no guarantee that those numbers will stay sta- 
ble. Thirty-seven per cent of respondents said 
that their satisfaction had worsened in the past 
year, and just 32% said that it had improved. 
A break-down of responses by employment 
sector showed how attitudes vary across the 
wide spectrum of scientific paths. Respond- 
ents working for non-profit organizations 
were especially likely to feel satisfied with their 
jobs (73%), followed closely by respondents 
in industry (71%), government (68%) and 
academia (67%). “This supports the fact that 
there are very fulfilling, high-paying jobs 
outside of academia,’ says Susan Porter, dean 
and vice-provost of graduate and postdoctoral 
studies at the University of British Columbia 
in Vancouver, Canada. > 
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> — Satisfaction numbers have shifted since 
the 2016 survey, which found a slightly higher 
proportion of satisfied scientists in academia 
(65%) than in industry (63%). The differences 
between the two surveys suggest that the bal- 
ance between academia and industry has 
slightly tilted towards companies. 

High levels of job satisfaction among 
researchers have been documented by other 
surveys, including one earlier this year of PhD 
holders by the University of British Colum- 
bia and one in 2016 of European researchers 
conducted by Vitae, a non-profit science- 
career advocacy organization in Cambridge, 
UK. But Vitae head Janet Metcalfe warns 
that job satisfaction isn’t always a sign of a 
positive working environment. “Researchers 
love doing research and therefore can have 
high job satisfaction, but they can still be 
experiencing high levels of stress and poor 
well-being,” she says. 


SALARY 

Questions about salary revealed a deeper 
divide between sectors. Fifty-nine per cent of 
respondents in industry said that they were 
happy with their salaries; by comparison, 
only 40% of respondents in academia, 41% in 
non-profits and 49% in government said that 
they were happy with their pay. Overall, 43% 
of respondents said that they were happy with 
their salary. Just over half of all respondents 
reported a recent pay rise, but it clearly wasn't 
enough to erase all disappointment. 

Sam Proskin, a senior geotechnical engineer 
with Thurber Engineering in Calgary, Canada 
is pleased with his salary and career choices. 
He had hoped to land a job in academia as 
he finished his PhD in geotechnical engineer- 
ing at the University of Alberta in Edmonton, 
Canada, in the mid-1990s. But after testing the 
academic waters in both the United States and 
Canada, and finding few opportunities and 
lots of stories of pressure and competition, he 


Mariana Pacheco Blanco enjoys academic research, despite funding struggles. 


decided that a career in consulting would bea 
better fit for his skill set and ambitions. 

Now, Proskin is ina position to offer advice 
to other young engineers and geologists who 
are pondering their futures. He encourages 
them to keep their options open and avoid 
the academia-or-nothing mindset. “Maybe 
it’s time to switch that thinking around,” he 
says. “The default mode should be that you 
are going into industry unless you’re very 
academically inclined.” 

Mariana Pacheo Blanco, a postdoctoral 
researcher at BIOCEV, a biotechnology and 
biomedical academic research centre in Vestec, 
Czech Republic, is happy with her job in aca- 
demia. But she also has a major complaint: she’s 
disappointed with 


the funding oppor- “The default 
tunities. “I spent five mode should 
years in Germany bethat you 
where theres alot of are going into 
funding for science? industry unless 
says Blanco, who is you’re very 
originally from Mex- academically 


ico but earned her 
PhD at the Univer- 
sity of Munster in Germany. “The difference 
between Germany and the Czech Republic is 
considerable.” 

As a postdoc struggling to get by, she’s part 
of another great divide that might be even 
more fundamental than industry versus aca- 
demia: the gap between the haves and the 
have-nots (see ‘Different paths’). Just 5% of 
respondents reported a salary of more than 
US$150,000 per year. Nearly 30% reported a 
salary between $50,000 and $80,000, and close 
to 25% said that they earned between $30,000 
and $50,000. At the other end of the scale, 
11% reported earning between $15,000 and 
$30,000, and 12% didn’t earn even that much. 

Job titles matter when it comes to salary. 
Although a few professors, managers and 
research directors reported earning less than 


inclined.” 
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$15,000, that end of the salary range was dom- 
inated by teachers. About 50% of respondents 
who said that they are mainly teachers earned 
less than $30,000, and nearly 30% of research 
or staff scientists were at the same modest 
place on the salary scale. The upper ends of the 
scale were populated mostly by full professors, 
managers and research directors. 

As with the 2016 salary survey, geography 
proved to bea strong determinant of salaries. 
Nearly 40% of respondents in Asia reported 
earning less than $15,000 a year, compared 
with 2% of respondents in North America. At 
the top level, 11% of all respondents in North 
America and Australasia reported earning 
more than $150,000, putting them far ahead 
of other regions. Pockets of hardship con- 
tinue to persist in Europe. Just over 20% of 
respondents in Europe reported earning less 
than $30,000 a year compared with just 5% 
of those in North America — a gap that is 
unchanged from our 2016 survey. 

The survey also reflected gender disparities 
in salary, especially in scientists who had been 
in their profession for many years. Among 
respondents who said that they were in the 
later part of their career, 33% of men reported 
earning more than $110,000 a year, but only 
23% of women reached that level. For entry 
level, early-career and mid-career respond- 
ents, income brackets were fairly evenly split 
between the genders. Women were also more 
likely than men to report being unhappy with 
their salaries (59% compared with 53%). 

The question “Are you happy with your sal- 
ary?” turned out to be an exercise in relativism. 
More than 20% of respondents who earned 
more than $150,000 a year said that they were 
unhappy with their salaries, whereas 27% 
of people who earned between $15,000 and 
$30,000 a year reported that they were happy 
with their lot, probably because that range fit 
their expectations and cost of living. 

Shrisha Rao, a computer scientist at the 
International Institute of Information Tech- 
nology in Bangalore, India, hasn’t reached a 
lofty income bracket, but is happy with his 
salary and gratified with his work. “I’m for- 
tunate to be at an institution that values me,” 
he says. “I'm not making more than people in 
the United States, but my income is high for 
my country and my profession” 


JOB PATH 

Still, unsurprisingly, academia remains a 
popular destination: nearly three-quarters 
(70%) of respondents said that it had been 
their main goal as they finished their PhDs, 
which is in line with the aspirations of stu- 
dents who responded to Nature's 2017 gradu- 
ate-student survey (see Nature 550, 549-552; 
2017). Blanco, who studies non-Hodgkin lym- 
phoma, says that she is highly satisfied with 
her job, mainly because she is finds her topic 
compelling. “I’m working on cancer, which 
makes the science more exciting,” she says. 
“It’s the hot spot everywhere.’ But she gives > 
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Different paths 


Nature’s 2018 survey of salary and job satisfaction, which drew more than 4,300 responses from 
science professionals around the world, found disparities across the science spectrum. Gender, 
geography and area of employment — academia, industry, government or non-profit — all help to 
shape career outcomes. Science offers many paths, but some are more rewarding than others. 
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Q | Job satisfaction levels in different countries 


Netherlands 80% (44 respondents) 
China 75% (61) 
United Kingdom 72% (358) 
India 71% (266) 
Japan 70% (78) 
United States 68% (1,399) 


Italy 64% (160) 


Germany — 60% (229) 


Brazil — 54% (61) 


Switzerland -| 49% (59) 


Differences in funding, job opportunities and 
local politics can make a scientific career more 
satisfying in some countries than others. 


0 | How satisfied are you with your current job? 
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educational-research agency Shift Learning. *Average for career stage. 
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| What is the biggest challenge to your 
career progression? 
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> even more credit to her supervisor, cancer 
biologist Ondrej Havranek, who, is generous 
with his time and advice, and doesn’t demand 
long hours from his team. “If I want to leave 
at 5 p.m., I can leave at 5 p.m.,” she says. “He 
respects my private life, which can be a prob- 
lem for some people in academia.” 

Rao, who earned his PhD at the University 
of Iowa in 2005, says that he has had his sights 
set on academia since the early days of his 
science training. As a professor at a technol- 
ogy institute where he studies cloud comput- 
ing and resource utilization, he feels that the 
plan paid off. “I enjoy a lot of academic free- 
dom, he says. “People aren't telling me what 
to do on a regular basis.” Still, he says, “My 
peers and I would be happier if we had more 
resources. India punches below its weight 
when it comes to science. That’s a hard fact.” 


SATISFACTION 

Finding enough hours in the day to main- 
tain both a career and a personal life can be a 
challenge no matter where a scientist works. 
Although most respondents were satisfied 
with their own work-life balance, this is 
another area where industry seemed to have 
an edge over academic life: 79% of respond- 
ents in industry said that they were somewhat 
or extremely satisfied with this aspect of their 
career, compared with 68% of respondents in 
academia. 

Job satisfaction is a multifaceted matter. 
When asked to identify the factor that was 
most important to satisfaction, respondents 
put ‘interest in the work at the top of the list. 
That factor also ranked highest in terms of 
actual satisfaction — a happy case in which 
something that’s deemed important actually 
delivers. Still, other aspects of being a scientist 
are dragging respondents down. Survey par- 
ticipants were generally unsatisfied with their 
ability to influence decisions that affect them, 
job security, career-advancement opportuni- 
ties and recognition for achievements — all 
things that they felt were important in making 
a job worthwhile. 


WELL-BEING 
Respondents were also frank about the nega- 
tive effects of their work on their mental 
health. Sixteen per cent of those surveyed 
said that they had either received help or were 
currently getting help for depression or anxi- 
ety. Seventeen per cent said that they had not 
received help but would like to. And 3% had 
sought help but had yet to receive it. Those 
responses follow a pattern of widespread 
unease in science. In Nature’s 2017 graduate- 
student survey, 12% of respondents said that 
they had sought help for anxiety or depression 
that was directly caused by their studies. 
Instances of harassment and discrimination 
— problems that remain stubbornly common 
in science — can, of course, also undermine 
researchers job satisfaction or ability to per- 
form their job. More than one-quarter (28%) of 


rats i ak 


respondents reported observing such problems 
at their current job, and more than one-fifth 
(21%), said that they had personally experi- 
enced such treatment. Of those who said that 
they had witnessed or experienced some sort 
of discrimination, nearly half (47%), said that 
they had experienced gender discrimination, 
the most common type. Ninety-one per cent 
of respondents who said that they had person- 
ally experienced gender discrimination were 
female. Discrimination based on age (23%) or 
race (22%) were also relatively common. 

About half of all respondents felt that their 
workplace is doing enough to promote diver- 
sity. Those working in industry (58%) were 
more likely than those in academia (50%) to 
say that their institution is on top of the issue. 
Hannah Murfet, a quality compliance manager 
at Microsoft Research in Cambridge, UK, gives 
her employer high marks. “Where I work now, 
the mission is very centred on diversity and 
inclusion,’ she says. 

Murfet, who is an advocate for women in 
science, helped to form the Next Generation 
Network, a group dedicated to helping young 
researchers learn 
about careers in com- 
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great if more people 
could consider this opportunity,’ she says. 
Most of all, she wants to encourage young peo- 
ple to keep their minds open to a wide range 
of options. “If you’re interested in industry, 
find an opportunity to get in — it can take you 
where you want it to,’ she says. “You don’t have 
to stay as a bench scientist. You can move into 
marketing, sales or compliance.” 

Nearly 60% of respondents feel positive 
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Aaron Pan switched from the academic track to a curatorial role. 


about their future job prospects — a rate that 
hasn't changed much from the 2016 sur- 
vey — but that optimism is not evenly dis- 
tributed. Scientists were more likely to have 
a rosy outlook if they had a full-time job and 
were under the age of 40 and male. The 25% 
with a negative outlook were more likely to be 
female and to have a temporary contract. On 
another pessimistic note, more than half (51%) 
of respondents said that their job prospects are 
worse or much worse than they were for pre- 
vious generations. Still, a full 75% of respond- 
ents said that they would recommend a career 
in scientific research to students — a marked 
increase from the 61% who took that position 
in the 2016 survey. 

One of those science supporters is Aaron Pan, 
executive director of the Don Harrington Dis- 
cover Center, a science museum in Amarillo, 
Texas. Pan started a postdoc at Southern Meth- 
odist University in Dallas, Texas, after earning a 
PhD in palaeobotany, and was planning to con- 
tinue on the academic path until another option 
opened up. He applied for a curator position at 
the Fort Worth Museum of Science and History 
in Texas, and his career trajectory changed for- 
ever. “I think I would have been happy on many 
different paths,” he says. “Here, I still get to do 
research, but it’s research I want to do. I don't 
have to publish just to get tenure.” 

Whether in academia, industry, non-prof- 
its or government, there are lots of places to 
do science, and lots of ways to be a scientist. 
The Nature survey highlights the diversity 
of options, but it also points to issues that all 
researchers should keep in mind as they plot 
their course. From salary to job satisfaction, 
many things can go right — but they might 
not. The good news is that science, for most, 
will always be interesting. And that can be just 
enough to keep a person going. = 


Chris Woolston is a freelance writer in 
Billings, Montana. 
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wo days ago it had been 

| “we dont negotiate with 

terrorists’, but it was diffi- 

cult not to negotiate with people 
whod stolen the Sun. 

In the corner, the TV droned 
quietly. Most channels weren't 
broadcasting, but the BBC kept 
the news running, interspersed 
with nonsensical cartoons. 
The breaking-news banner 
announced that the prime minis- 
ter, along with other world lead- 
ers, had gone to the mothership 
to negotiate with the Glaosch 
Imperium, who had turned up to 
‘invite’ Earth to join their empire. 

Theyd reacted to our (mostly) 
polite refusal by parking a space- 
ship the size of the Moon between 
us and the Sun. Our follow- 
ing, more explosive, refusal had 
been shrugged off like we were 
mosquitoes. 

“I cant believe youre actually going,” Ryan 
said. “You know what they're doing? Herd- 
ing us into easily managed groups, is what. 
You should come with me to the bunker” 

I put down my water bottle a little too 
firmly. 

“What else are we supposed to do? The 
National Grid can’t cope with the demand of 
this constant darkness, and the demand for 
electricity and gas is only going to increase 
as it gets colder. You want people dying in 
their homes because they don’t want to put 
the heating on? We get enough of that when 
there aren't aliens trying to freeze our asses 
off. People are hurt and scared. They need 
a doctor” 

Ryan sighed and tugged a hand through 
his hair. “Fine. I get it. Just promise me if it 
comes to it, you'll get out and find me.” 

“I promise.” 

“And don’t bring anyone with you. The 
bunker’s set up for eight, we won't be able to 
let anyone else in.” He pulled me into a hug. 
“See you, sis.” 

As I finished packing I wondered how my 
younger brother's weird hobby of filling our 
parents’ shed with bottled water, candles and 
tinned food had turned into part-ownership 

of a nuclear-fallout 


> NATURE.COM shelter. 

Follow Futures: The village hall was 
 @NatureFutures packed with camp 
Ei go.naturecom/mtoodm beds and air beds, 


TOTALITY 


Keeping it cool. 


most occupied by huddled families under 
mounds of blankets. There was only so 
much the ancient heaters could do against 
the plummeting temperature outside. The 
darkness already smelt of too many bodies 
and overworked electrics. I made my way to 
the pool of light coming from the kitchen to 
report for duty. 

It turned out there wasn’t much for me to 
do. People were quiet, subdued, panic long 
since burnt out. I passed out plasters and 
prescribed sleeping tablets and an emer- 
gency inhaler, which the village pharmacist 
dispensed from a suitcase. 

Mostly we just watched the news. And the 
cartoons. 

The content was bland and repetitive, 
obviously designed to stop people panicking 
again. There was nothing about what would 
happen to us if the Glaosch didn't move their 
spaceship. I knew, because Ryan had told me. 
Before long, most of the plant life would be 
dead. Animals would die of the cold if star- 
vation didn’t take them first. Humans might 
hang on longer, but since it would be minus 
100 degrees in a year, it wouldn't be much 
longer. 

I checked my watch. It said ten past one 
but I honestly wasn't sure if that was a.m. or 
p-m.. Or what day it was. The teachers in the 
crowd were pretty good at keeping routine 
for the children, but mostly it was just a blur. 

There was a wave of shushing and I looked 


616 | NATURE | VOL 562 | 25 OCTOBER 2018 


© 2018 Springer Nature Limited. All rights reserved. 


up to see the prime minister on 
the television, looking even more 
grey than usual. She was outside 
Downing Street in the same outfit 
shed been wearing when she left. 

“After a week of difficult 
negotiations,” she said, “it has 
been agreed that Planet Earth, 
henceforth known as Terra, will 
become a member of the Glaosch 
Imperium? 

There was a collective cry — 
grief, relief? I couldn't tell. Beside 
me someone sobbed. “It’s over. It's 
finally over.” 

You don't know how right you 
are, I thought. Ryan had seen it 
coming. 

“They don’t have to invade us,” 
hed said. “All they have to do is 
wait? 

The subtitles stuttered out the 
PM'’s instructions. “Many of you 
are gathered safely in school halls 
and community centres. We ask 
that you remain there so you can 
be processed. If you are at home, please stay 
there; we will find you” 

I checked my pocket for my keys and 
slipped my phone onto the table — Ryan 
had been very clear about not bringing it 
with me, or writing down the bunker’s loca- 
tion. No one paid any attention as I drifted 
towards the exit. I grabbed the pharmacy 
suitcase on the way. It seemed unlikely any- 
one else would need it now. 

Outside, the air felt expectant. I looked up 
at the sky, feeling a wave of dizziness at the 
blackness that spread over it like a blanket. 
No Moon, no stars. They must be right above 
us. 

My car was parked out on the road, on 
double yellow lines, but no one cared about 
things like that any more. It started first 
time for a change, as if it knew this was the 
one that mattered. I threw my satnav out 
the window. As I drove, the sky got lighter, 
daylight creeping in like Ryan after curfew, 
and before long it was like nothing had hap- 
pened. I fumbled sunglasses from the glove 
box, even though the Sun was already slip- 
ping towards the horizon. 

Ishivered, and drove into a tomorrow that 
belonged to the Glaosch Empire. = 


C. L. Holland is a British writer of 
speculative fiction. She has a BA in English 
with creative writing, an MA in English, and 
likes to learn things for fun. 
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UNIVERSIDAD DE LOS ANDES (UNIANDES): 
A UNIVERSITY OF SOCIAL INCLUSION 
AND SUSTAINABLE RESEARCH 


A conversation with , president of Universidad de los Andes (UNIANDES) 


How does UNIANDES 
innovate in education? 
Universidad de los Andes is 

a liberal, non-profit university 
that focuses on well-rounded 
education, with an emphasis 

on humanistic and scientific 
thinking. We are in the process of 
curricular development, focusing 
on intensive research activities 
with professors and students, 
flexibility and interdisciplinarity. 
Reforms will enhance the 
connections between teaching 
and research, promoting 
students’ autonomy. UNIANDES 
is an increasingly international 
university with more than 339 
undergraduate students and 

112 international professors. | 
believe this is the most important 
effort we have recently made to 
improve education. 


How is UNIANDES engaged 
with promoting social 
inclusion? 

We are committed to make 
UNIANDES the most inclusive 
university in Colombia. 
Moreover, our goal is to 
become a blind-admission 
university, benefiting from 

the best minds, regardless 

of economic and social 
conditions. We have a 
vice-presidency role for 
development and alumni, in 
charge of fundraising. We are 
devoted to obtaining funds for 
talented students who cannot 
afford our tuition fees. We 
have created a scholarship 


program called Quiero 
Estudiar, based on the principle 
of reciprocity. The students 
sign a reciprocity agreement 
that, in the future, they will 
help other students to realize 
the same aspirations they are 
being helped to achieve. 


What are the strategies used 
by UNIANDES to achieve 
excellence in research? 

We measure our results 

and impact by international 
standards. We also 

strongly believe in research 
networks, and our most 

cited publications result from 
international collaborations. 
Our office of international 
affairs identifies strategic 
partners worldwide with which 
we promote exchanges. 


How does UNIANDES 
contribute to local or regional 
issues? 

UNIANDES has started to 
work in different regions of 
the country. We renewed 
dialogue with regional 
universities, NGOs, and local 
mayors and governors. We 
aim to collaborate on regional 
strategic planning. We are also 
engaged in strengthening our 
community-based research 

to understand and respond 

to local needs. We signed an 
agreement with the network 
of national natural parks and 
are establishing field stations 
throughout the country. The 


first one we are creating, 

and will offer to the research 
community, is located on the 
Nevados natural park in the 
Paramo, a unique Andean 
ecosystem. These field stations 
will contribute to sustainable 
local development through 
interdisciplinary research. 


What have been some 

of UNIANDES's recent 
collaborations and networks? 
Our most important research 


network is our most recent one. 


We have signed an agreement 
with Pontificia Universidad 

de Catolica Chile and with 
Tecnoldgico de Monterrey. 
We believe that these 
collaborations, in research, 
technological innovations and 
education, will be disruptive 
and innovative. At a local level, 
we have signed significant 
agreements with Agrosavia 

to contribute to Colombia's 
agricultural development. 
Agrosavia is a joint public- 
private, decentralized, not-for- 
profit institution for scientific 
and technical discovery. The 
corporation's purpose is 
generating scientific knowledge 
and developing agricultural 
technologies. We host three 
Max Planck Tandem groups, 
in computational biology, 
computational physics and 
international law. We also 
participate in global initiatives 
in high-energy physics 
through CERN and Fermilab. 


Universidad de los Andes (UNIANDES) is focused on high-quality teaching, social inclusion and 
sustainable research. Publishing more than 1,000 articles in 
from experienced faculty and talented students both at the undergraduate and graduate level. Based 
in Bogota, the university is open to collaboration, shown by 
international networks. University president, Pablo Navas, explains how UNIANDES plans to make 
Colombia a world-class research destination. 
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We host an International 

Mixed Laboratory, funded 

by the French Institute IRD 
(Institut de Recherche pour 

le Développement) and in 
collaboration with the Pontificia 
Universidad Catolica de 
Ecuador (PUCE) to develop 
collaborative research in the 
study and protection of Andean 
ecosystems. We are also 

very proud to host the South 
American center for the study 
of the Sustainable Development 
Goals, one of the Jeffrey Sachs 
initiative centers. 


What are some examples 

of impact case studies 
developed by the university? 
We are strengthening our 
technology transfer strategy. 
We are developing patents 
and products that are being 
transferred to society. Also, 
some of our innovations have 
been adopted by several 
Colombian institutions. For 
example, the classification 
of products developed 

by professors from the 
schools of Architecture 

and Design and Arts and 
Humanities was transferred 
to Colciencias. The first 
digital computer in Colombia 
was installed at UNIANDES, 
revolutionizing research. 
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Student in Biological 


Sciences focusing on 


evolutionary ecology, is one 


of the researchers from 


Universidad de los Andes 


Who described a new 


species of poison frog: 


Andinobates victimatus. 
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UNIVERSIDAD JAVERIANA: INNOVATIVE 
REDEARCTA POR PROGRESS AND PEACE 


A conversation with JORGE HUMBERTO PELAEZ PIEDRAHITA S.J., Rector of Pontificia Universidad 
Javeriana 


What makes Colombia an 
interesting destination for 
researchers from around the 
world? 

The economic stability in Colombia 
has favoured the sustained 
development of high-quality 
research universities. Additionally, 
Colombia is exceptionally well- 
located geographically and is the 
second most biodiverse country 

in the world. A recent peace 
agreement, after 50 years of 
conflict, has made Colombia a 
unique peace-building laboratory 
and has removed barriers for 
conducting research in a broad 
range of subjects. We can 

see from the rising number of 
researchers coming to Colombia 
that the country is an interesting 
hotspot for research during this 


design effective mechanisms to 
reintroduce participants of the 
conflict into civilian life. We lead 
the way in implementing the 
transformation of the civil war 
into a constructive dialogue with 
all members of society. These 
examples demonstrate that the 
peace-building process must 
encompass political, economic, 
social, and environmental 
aspects. To this end, Universidad 
Javeriana has developed strategic 
alliances with local, national, and 
international organizations in 
order to provide local solutions 
with a global scope. 

Furthermore, we highly value 
inter-institutional collaboration and 
multidisciplinary approaches to 
tackle the complex socioeconomic 
challenges faced by Colombia. For 


institutional weaknesses that 
are reflected, for example, in 
the country’s corruption and 
deep economic inequality. 

In many cases, these are a 
consequence of the absence of 
sound ethical values. 

The profound institutional 
transformation that the country 
needs to go through calls for 
creative approaches that will not 
occur spontaneously. This new 
environment requires people 

to be prepared not only with 
the highest levels of education 
but also with solid ethics that 
prioritize human dignity, respect 
for others, and respect for the 
common good. As a Catholic 
and Jesuit university we are 
committed to educating the 
whole person, developing 


Universidad Javeriana is one of the largest and most prestigious universities in Colombia. Founded in 1623, 
it is also one of the oldest educational institutions in South America. Rector Jorge Humberto Peldez S.J. 
discusses how the research agenda at Universidad Javeriana is contributing to a new Colombian society, with 


innovative and interdisciplinary approaches that foster collaboration with other institutions. 


institutions through research. 
This public funding from 
Colombia Cientifica is a 
recognition of our research 
achievements. The first 
program promotes the use 
of Colombia's biodiversity 

to produce phytomedicines 
— medicinal plant-based 
compounds — to treat 
cancer. The second program 
strives to reduce the time for 
producing and optimizing new 
sustainable crop varieties by 
using Omic Sciences (e.g.: 
genomics, metabolomics, 
etc.). The development of 
these programs has fostered 
international collaboration 
with leading global institutions, 
other Colombian universities, 
and institutions in the private 
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The search for innovative answers 
to local challenges is one of our 
main contributions to Colombian 
society. Both research and 
teaching at Universidad Javeriana 
play a transformative role. For 
example, in the field of Health, our 
Faculty of Medicine studies the 
mental health impact of the 
internal conflict on our population. 
Likewise, in the Social Sciences, 
the Institute for Intercultural 
Studies, in our campus in the city 
of Cali, brings together social, 
military, business, and former 
combatant leaders of the FARC 
— in three strategic conflict zones 
in southwestern Colombia — to 


intercultural studies; ecology and 
territory; public health; clinical 

and biostatistical epidemiology; 
infectious diseases; human genetics; 
immunobiology and cell biology; 
water and environmental science 
and engineering; and control 
systems, power electronics 
and management of 
technological innovation. 


Since Universidad Javeriana 
is a Jesuit university, how 

are ethical values important 
for promoting research and 
progress? 

Colombia's social and economic 
progress has been hindered by 
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to promote world-class higher 
education in order to build a 
just, sustainable, inclusive, and 
democratic society. 


Could you mention some 
research highlights that have 
taken place at Universidad 
Javeriana in the last few 
years? 

Universidad Javeriana plays a 
significant role in leading two 
major research programs of 
Colombia Cientifica, the most 
important initiative funded 

by the national government 
to strengthen the quality of 
Colombian higher education 


Institute of Technology, 
University of Illinois, and 
University of Tokyo. We are 
keen on replicating these 
examples of success in further 
research collaborations with 
scholars and institutions from 
around the world. 


Pontificia Universidad 
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PIGS, SEEDS AND 
NEW BETS FOR 
GOOD RESEARCH 


In the quiet south of Colombia, far from the 
capital of Bogota, researchers from Universidad 
del Valle are TAKING ADVANTAGE OF LOCAL 
CREATIVITY AND BIODIVERSITY to establish 
the Valle del Cauca as a regional centre 

of development in agricultural and health 
business. 


At Universidad del Valle 

in Colombia, a group of 20 
scientists led by Professor 
Jose Oscar Gutierrez Montes 
have successfully completed 

a pig-to-pig lung transplant 
using a stem-cell based 
approach. The recipient, called 
Pachorron, received the new 
lung a year ago and has been 
living happily since without the 
need for immunosuppressive 
medication. 

According to Gutierrez “this 
represents a turning point in 
transplantation medicine”. 
Indeed, this achievement relies 
ona recently patented chimaera 
technology that allows organs 
to be prepared so that the 
immune response is suppressed 
once they are transplanted into 
the recipient's body. 

The researchers have since 
developed a related approach 
that could extend these 
techniques to humans and 
help to reduce the enormous 
costs of immunosuppressive 
therapies that today accompany 
transplantation. 

This accomplishment is the 
result of more than two decades 
of multidisciplinary research 
involving local hospitals and 
research groups from health 
and materials engineering 
schools, fostered by the 
vice-presidency of research at 
Universidad del Valle. It follows 
other achievements, including 
a recently patented innovation 
for the preparation of 
nanostructured photo-sensitive 
compounds to help fight cancer 
and an external fixation device 
for bone fracture reduction that 
just hit the market in Colombia, 
Panama and Ecuador. 

Today, the vice-presidency 
for research is building on 
these achievements by 
encouraging young scientists 
and entrepreneurs and follow 
them with dedicated programs, 
convinced that high quality 


research will have a positive 
impact on the region. 


RICH BIODIVERSITY 
In equatorial zones, the climate 
is mild. The countryside 
is green. Sun and rain 
alternate. All this makes for 
a biodiversity hotspot -- and 
a honeypot for researchers 
seeking new microbes. One 
such investigator is David 
Johnston-Monje, the principal 
investigator of a Tandem 
Group with the Max Planck 
Institute for Plant Breeding 
Research. This young scientist 
has pioneered research on 
endophytes - microbes that 
are found inside seeds - which 
were recently shown to have 
a significant impact on plant 
growth, fruit size and flavour. 
Using biotechnology and 
metabolomics, the group is 
tracking plants to study their 
interactions with microbes, 
hoping to unveil beneficial 
organisms. Such research 
has the potential to improve 
the competitiveness of 
the agricultural sector and 
opens new green business 
opportunities with super seeds 
as ameans to address climate 
change and world population 
growth. For the vice-presidency 
of research, this new hire will 
help consolidate the local 
ecosystem for agricultural 
research which also includes 
the recently created Center 
for Research and Innovation in 
Bioinformatics and Photonics 
(CiBioFi) where data analytics 
and photonics are being used as 
a predictive tool for sustainable 
agriculture development. 


Universidad 
del Valle 
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COLOMBIA 


Students and academics march in Bogota, Colombia, to demand more financial resources for higher education. 


Science in Colombia 
on the cusp of change 


The South American country faces deep-rooted problems, but 
scientists are finding reasons to be hopeful. 


usana Fiorentino wants to unlock the 

secrets of Colombian folk medicine. 

The immunologist, who directs the 
Immunology and Cellular Biology group at 
the Pontifical Xavierian University in Bogota, 
Colombia’s capital of 7 million people, is 
working with extracts from South Ameri- 
ca’s anamu (Petiveria alliacea) and divi-divi 
plants, which she thinks can be used to treat 
breast cancer and leukaemia, thanks to their 
anti-tumour properties. 

But while trying to get a phase I trial started 
with an extract from the divi-divi tree, Cae- 
salpinia spinosa, she hit a snag. According 
to Colombia's drug-safety agency INVIMA, 
a molecule that identified the extract had to 


be chemically characterized before the plant 
could be used in a trial. There were no Colom- 
bian laboratories capable of doing the job, so 
she found labs in China and the United States 
that could do the work if she posted samples 
to them. But the request to re-import the 
fully characterized molecule, to be used in 
a Colombia-based trial, was denied with no 
clear reason, says Fiorentino. 

The only way to get that potential medi- 
cine approved would have been to extract, 
characterize, synthesize and manufacture it 
in Colombia, in a laboratory that had not yet 
been built. Fiorentino had to go back to the 
plant and isolate another molecule to use as an 
identifier. She eventually found a laboratory 


at Colombia's University of Antioquia that 
could characterize it, instead of having to look 
abroad for help. 

“It was absolutely ridiculous,” explains 
Fiorentino. “Getting that clinical protocol was 
a path over waterfalls filled with rocks and 
thorns” 

Fiorentino’s experience with Colombia's 
scientific bureaucracy is all too common for 
researchers in a country emerging from a 
half-century of civil war. With a peace accord 
between Marxist rebels and the government 
signed in November 2016, the country is turn- 
ing its attention to building the pillars needed 
to support a strong economy. 

Colombia, the only country in South 
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COLOMBIA 


Q&A 
Species seeker 


During his PhD 
at the University 
of Minnesota in 
Minneapolis, Juan 
Fernando Diaz 
Nieto was part 

of an expedition 
that trekked 
across the tropics 
of South America and discovered a new 
species of mouse oppossum (R. S. Voss 

et al. Am. Mus Novit. 3778, 1-27; 2013). 
He’ since returned to his native Colombia, 
where he is a professor of biology at EAFIT 
University in Medellin. 


What do you study? 

I work with marsupials, rodents and bats. 
I’ve always been interested in fieldwork 
— being able to go to unexplored regions 
that have a high potential for diversity. I 
also work in museums and use genetic 
techniques to investigate diversity. 


What's it like to do fieldwork in Colombia? 
For decades Colombia had zones that 
were too dangerous to go into. The 

2016 peace accord with the FARC, 
which ended the long civil war here, 
helped give access to large regions of 

the country. It’s still complicated: to 
access these areas, one must be careful, 
work out the logistics and sometimes 
ask permission — and not just of the 
government. Many areas have been taken 
over by other armed groups. 


Tell us about a project you recently led. 
In July, I and colleagues led a biodiversity 
project in the Anori region in Colombia, 
which is United Nations-backed and 
involved ex-combatants from the FARC. 
They weren't serving as guides but as 
co-investigators. We planned the project 
with them. After the fieldwork they came 
to our laboratories in Medellin, working 
alongside us. 


What’s the future for science in Colombia? 
There’s a lot of altruism among 
researchers here. One person might go 
into a difficult zone and end up giving 
the world access to a valuable sample. 
There are lots of social issues in our 
country that we have to face. But we are 
willing — our passion will 

guide us. = 


INTERVIEW BY ALESZU BAJAK 
This interview has been edited for length and 
clarity. 
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>» America to be bordered by both the 
Caribbean Sea and the Atlantic Ocean, is a 
mosaic of ecosystems, including mountain 
ranges, deep jungle, rugged coasts and expan- 
sive savannah. It is the second most biodiverse 
nation in the world (behind neighbouring 
Brazil) and, thanks in part to this natural labo- 
ratory, the country of 49 million people has 
dozens of universities and institutes working 
on home-grown science, ranging from the 
International Center for Tropical Agriculture 
in Cauca Valley to the natural and physical sci- 
ence laboratories of Bogotd’s largest universities 
(the city has more than 100 tertiary education 
institutes). 

But the key question being asked by Colom- 
bian scientists is whether science will find sup- 
port in the country’s post-war economy. For 
years, their outlook has been pessimistic in 
the face of strained budgets, meagre resources 
and red tape that stymie the scientific process, 
explains Andrew Crawford, a US biologist 
who, over the course of nine years at Bogotd’s 
Los Andes University, says he has developed 
a sense of the possibilities and challenges of 
doing science in Colombia. 

A reagent order that could be delivered ina 
day in the United States or Europe, for exam- 
ple, can take three months in Colombia, says 
Fiorentino. Monoclonal antibodies, she says, 
have sat for weeks without refrigeration in cus- 
toms, useless by the time they reached the lab. 

“If we do not improve these administrative 
problems,’ Fiorentino stresses, “Colombia will 
never, never be competitive” 


MEAGRE SUPPORT 

Many of these problems, researchers point out, 
are rooted in the government's low financial 
support for science. Colombia currently invests 
only 0.67% of its gross domestic product in 
science and technology, compared with 2.8% 
in the United States. 

Despite this, the country produces on aver- 
age more than 200 high-quality scientific 
studies a year on everything from physics 
to Earth and environmental sciences to the 
life sciences, according to the Nature Index, 
which tracks publication in high-quality sci- 
entific journals. Recent Colombian papers 
have made advances in research on dark mat- 
ter, urban lizards and forest fragmentation. But 
that output trails behind that of other coun- 
tries in the region, such as Brazil and Argen- 
tina, which until recently invested a lot more 
in research. Colombia's spending was only one- 
fifth of Argentina's in 2015, for example. Today, 
however, uncertain political and economic 
conditions in those two countries might offer 
Colombia a chance to catch up. 

Last year, 13 Nobel laureates from around 
the world wrote to then-president Juan 
Manuel Santos, urging him to raise the 
government’s investment. “The Colom- 
bian budget for science and technology for 
2018 continues to be extremely low,” they 
wrote, adding: “The consequences will be 
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devastating and irreversible, because science 
and education are long-term efforts that must 
be supported consistently” 

The letter summarized the growing disen- 
chantment among Colombian researchers 
about the future of science there. As Enrique 
Forero, president of the Colombian Academy of 
Exact, Physical and Natural Sciences in Bogota, 
points out, federal funding is decreasing fur- 
ther. “Support from the government is not very 
acceptable. It’s very, very, very low and it seems 
to be going down every year,’ he says. 


BRAIN DRAIN 

Catalina Pimiento is one Colombian scientist 
who left and never returned. After graduating 
from the Pontifical Xavierian University with 
a degree in biology, she moved first to Mexico, 
then Panama, the United States, Switzerland, 
Germany and now Wales, where she has a 
postdoctoral fellowship at the University of 
Swansea, investigating mass extinctions like the 
one that took out the giant shark Carcharocles 
megalodon. As much as it pains her, she won't 
return to her country of birth. 

“Because of my experiences and the career 
opportunities I’ve had abroad, I decided to 
have nothing to do with Colombia,’ she says. 
“Colombia does not invest in science. It’s that 
simple” 

That's not to say that researchers in Colombia 
cant make a living — or new discoveries — in 
the current climate. Juan Fernando Diaz Nieto, 
a biologist at EAFIT University in Medellin, has 
a successful career identifying new opossums, 
rodents and bats in the country’s hinterlands 
that had for decades been inaccessible because 
of the violence and threat of kidnapping associ- 
ated with the civil war (see ‘Species seeker’). 

“The peace accord helped with access to 
large regions of the country,’ says Diaz Nieto, 
who recently discovered two new marsupial 
species along the river of Colombia's Magda- 
lena basin, in the northwest of the country. 
“We described two new species of mouse oppo- 
sum that are endemic to Colombia, which we 
found on either side of the Magdalena River,’ 
he explains. 

Diaz Nieto, who gained his PhD at the 
University of Minnesota in Minneapolis, says 
he was able to make the discoveries because “in 
lieu of large budgets, we relied on the coopera- 
tion of colleagues at other Colombian institu- 
tions and cobbled together many small grants 
from the National Science Foundation, the 
University of Minnesota and the American 
Society of Mammalogists.” 


SCIENCE TAX 

The Colombian government is trying to get 
better at funding science, however. A law 
passed in 2012 diverted 10% of royalties from 
natural-resource extraction into a science and 
technology fund that was allocated across 
Colombia's 32 government departments. 
The initiative was designed to help stimulate 
science by sponsoring research projects and 
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investing in university research centres in a 
geographically equitable manner. 

Some of it worked. Infectious-disease 
research was funded along Colombia's 
Caribbean coast. In Bogota, Fiorentino received 
more than US$1 million towards her bio- 
pharmaceutical research. And 300 kilometres 
to the west, a new scientific research centre was 
built at the University of Caldas in Manizales. 

But much of the scheme backfired, in part 
because of the bureaucracy the new system 
required. “It was literally impossible to spend 
the money because of the logistics and paper- 
work,’ explains Crawford. “The whole thing 
was designed upside down” 

The nail in the coffin, explains Moises 
Wasserman, former director of the National 
University of Colombia in Bogota, was that 
approval of these scientific projects was 
placed in the hands of local politicians instead 
of scientific review boards. “It was very 
poorly planned, he says. “The decisions and 
a large part of the execution were left to very 
politicized entities who have relatively short 
tenures — governors with four-year terms.” 

Under the auspices of provincial governors, 
millions went to agriculture, aquaculture, 
energy and infrastructure projects that could 
hardly be characterized as science, accord- 
ing to an analysis by investigative reporters 
at La Silla Vacia, a Colombian political news 
website. The convoluted application process 
ultimately slowed the disbursement down so 
much that two of every five pesos in the fund 
went unspent, the investigation claimed. 

Nature approached the Colombian 
government for comment, but did not hear 
back before this story went to press. 


LOOKING AHEAD 

But many Colombian scientists say there's light 
at the end of the tunnel. Last June, the govern- 
ment proposed a bill that will create a Min- 
istry of Science, Technology and Innovation. 


COLOMBIA 


The bill is designed to ensure that science has 
a voice at the highest level of government, 
according to Ivan Dario Agudelo, a senator 
and the bill’s sponsor. 

And last July, a month before now-president 
Ivan Duque Marquez was sworn in, he met 
Forero and 30 other Colombian scientists at 
the National Academy of Sciences in Bogota to 
discuss the future of science in Colombia. By 
the end of the meeting Duque had pledged, on 
Twitter, to restructure how science was admin- 
istered in the country. 

“Things seem to be changing,” says Forero. 
“We hadn't had a president in the academy 
headquarters for 200 years. Or a president- 
elect. We're not completely invisible any more. 
People know that we exist.” 

Some Colombian researchers aren't waiting 
for the government to increase its investment 
in science or build a new ministry. Car- 
los Guarnizo, an ecologist at the Los Andes 
University, says there are things that indi- 
vidual scientists can do to grow science in the 
country. Guarnizo and his colleagues started 
Ciencia Sumercé, a science-communication 
initiative that includes live events at a restau- 
rant in central Bogota and regularly draws 
crowds of hundreds, eager to learn about arti- 
ficial intelligence, climate change or urban 
ecology. 

One priority is keeping the panellists as 
diverse as he can find. “We want to show that 
researchers are not just bald old men but ordi- 
nary people, of any age, of any gender,’ says 
Guarnizo, who hopes his events and accom- 
panying social media activity will draw young 
people to science. 

In that way, he hopes to grow a new genera- 
tion of science-savvy Colombians. “That could 
have untold consequences for our country.” = 


Aleszu Bajak teaches journalism at 
Northeastern University in Boston, 
Massachusetts. He grew up in Bogota. 


Ascience café in Manizales, Colombia, discusses how biotechnology might help local coffee growers. 
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